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Series  Preface 


Mathematics  is  playing  an  ever  more  important  role  in  the  physical  and 
biological  sciences,  provoking  a  blurring  of  boundaries  between  scientific 
disciplines  and  a  resurgence  of  interest  in  the  modern  as  well  as  the  classical 
techniques  of  applied  mathematics.  This  renewal  of  interest,  both  in  research 
and  teaching,  has  led  to  the  establishment  of  the  series:  Texts  in  Applied 
Mathematics  (TAM). 

The  development  of  new  courses  is  a  natural  consequence  of  a  high  level 
of  excitement  on  the  research  frontier  as  newer  techniques,  such  as  numerical 
and  symbolic  computer  systems,  dynamical  systems,  and  chaos,  mix  with  and 
reinforce  the  traditional  methods  of  applied  mathematics.  Thus,  the  purpose 
of  this  textbook  series  is  to  meet  the  current  and  future  needs  of  these 
advances  and  encourage  the  teaching  of  new  courses. 

TAM  will  publish  textbooks  suitable  for  use  in  advanced  undergraduate 
and  beginning  graduate  courses,  and  will  complement  the  Applied  Mathe¬ 
matical  Sciences  ( AMS )  series,  which  will  focus  on  advanced  textbooks  and 
research  level  monographs. 


Preface  to  the  Fourth  Edition 


There  are  two  major  changes  in  the  Fourth  Edition  of  Differential  Equations 
and  Their  Applications.  The  first  concerns  the  computer  programs  in  this  text. 
In  keeping  with  recent  trends  in  computer  science,  we  have  replaced  all  the 
APL  programs  with  Pascal  and  C  programs.  The  Pascal  programs  appear  in 
the  text  in  place  of  the  APL  programs,  where  they  are  followed  by  the  Fortran 
programs,  while  the  C  programs  appear  in  Appendix  C. 

The  second  change,  in  response  to  many  readers’  suggestions,  is  the  in¬ 
clusion  of  a  new  chapter  (Chapter  6)  on  Sturm-Liouville  boundary  value 
problems.  Our  goal  in  this  chapter  is  not  to  present  a  whole  lot  of  technical 
material.  Rather  it  is  to  show  that  the  theory  of  Fourier  series  presented  in 
Chapter  5  is  not  an  isolated  theory  but  is  part  of  a  much  more  general  and 
beautiful  theory  which  encompasses  many  of  the  key  ideas  of  linear  algebra. 

To  accomplish  this  goal  we  have  included  some  additional  material  from 
linear  algebra.  In  particular,  we  have  introduced  the  notions  of  inner  product 
spaces  and  self-adjoint  matrices,  proven  that  the  eigenvalues  of  a  self-adjoint 
matrix  are  real,  and  shown  that  all  self-adjoint  matrices  possess  an  ortho¬ 
normal  basis  of  eigenvectors.  These  results  are  at  the  heart  of  Sturm-Liouville 
theory. 

I  wish  to  thank  Robert  Giresi  for  writing  the  Pascal  and  C  programs. 


New  York  City 
May,  1992 


Martin  Braun 


Preface  to  the  Third  Edition 


There  are  three  major  changes  in  the  Third  Edition  of  Differential  Equations 
and  Their  Applications.  First,  we  have  completely  rewritten  the  section  on 
singular  solutions  of  differential  equations.  A  new  section,  2.8.1,  dealing 
with  Euler  equations  has  been  added,  and  this  section  is  used  to  motivate  a 
greatly  expanded  treatment  of  singular  equations  in  sections  2.8.2  and  2.8.3. 

Our  second  major  change  is  the  addition  of  a  new  section,  4.9,  dealing 
with  bifurcation  theory,  a  subject  of  much  current  interest.  We  felt  it 
desirable  to  give  the  reader  a  brief  but  nontrivial  introduction  to  this 
important  topic. 

Our  third  major  change  is  in  Section  2.6,  where  we  have  switched  to  the 
metric  system  of  units.  This  change  was  requested  by  many  of  our  readers. 

In  addition  to  the  above  changes,  we  have  updated  the  material  on 
population  models,  and  have  revised  the  exercises  in  this  section.  Minor 
editorial  changes  have  also  been  made  throughout  the  text. 


New  York  City 
November ,  1982 


Martin  Braun 


Preface  to  the  First  Edition 


This  textbook  is  a  unique  blend  of  the  theory  of  differential  equations  and 
their  exciting  application  to  “real  world”  problems.  First,  and  foremost,  it 
is  a  rigorous  study  of  ordinary  differential  equations  and  can  be  fully 
understood  by  anyone  who  has  completed  one  year  of  calculus.  However, 
in  addition  to  the  traditional  applications,  it  also  contains  many  exciting 
“real  life”  problems.  These  applications  are  completely  self  contained. 
First,  the  problem  to  be  solved  is  outlined  clearly,  and  one  or  more 
differential  equations  are  derived  as  a  model  for  this  problem.  These 
equations  are  then  solved,  and  the  results  are  compared  with  real  world 
data.  The  following  applications  are  covered  in  this  text. 

1.  In  Section  1.3  we  prove  that  the  beautiful  painting  “Disciples  of 
Emmaus”  which  was  bought  by  the  Rembrandt  Society  of  Belgium  for 
$170,000  was  a  modern  forgery. 

2.  In  Section  1.5  we  derive  differential  equations  which  govern  the 
population  growth  of  various  species,  and  compare  the  results  predicted  by 
our  models  with  the  known  values  of  the  populations. 

3.  In  Section  1.6  we  derive  differential  equations  which  govern  the  rate  at 
which  farmers  adopt  new  innovations.  Surprisingly,  these  same  differential 
equations  govern  the  rate  at  which  technological  innovations  are  adopted  in 
such  diverse  industries  as  coal,  iron  and  steel,  brewing,  and  railroads. 

4.  In  Section  1.7  we  try  to  determine  whether  tightly  sealed  drums  filled 
with  concentrated  waste  material  will  crack  upon  impact  with  the  ocean 
floor.  In  this  section  we  also  describe  several  tricks  for  obtaining  informa¬ 
tion  about  solutions  of  a  differential  equation  that  cannot  be  solved 
explicitly. 
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5.  In  Section  2.7  we  derive  a  very  simple  model  of  the  blood  glucose 
regulatory  system  and  obtain  a  fairly  reliable  criterion  for  the  diagnosis  of 
diabetes. 

6.  Section  4.5  describes  two  applications  of  differential  equations  to 
arms  races  and  actual  combat.  In  Section  4.5.1  we  discuss  L.  F.  Richard¬ 
son’s  theory  of  the  escalation  of  arms  races  and  fit  his  model  to  the  arms 
race  which  led  eventually  to  World  War  I.  This  section  also  provides  the 
reader  with  a  concrete  feeling  for  the  concept  of  stability.  In  Section  4.5.2 
we  derive  two  Lanchestrian  combat  models,  and  fit  one  of  these  models, 
with  astonishing  accuracy,  to  the  battle  of  Iwo  Jima  in  World  War  II. 

7.  In  Section  4.10  we  show  why  the  predator  portion  (sharks,  skates,  rays, 
etc.)  of  all  fish  caught  in  the  port  of  Fiume,  Italy  rose  dramatically  during 
the  years  of  World  War  I.  The  theory  we  develop  here  also  has  a 
spectacular  application  to  the  spraying  of  insecticides. 

8.  In  Section  4.11  we  derive  the  “principle  of  competitive  exclusion,” 
which  states,  essentially,  that  no  two  species  can  earn  their  living  in  an 
identical  manner. 

9.  In  Section  4.12  we  study  a  system  of  differential  equations  which 
govern  the  spread  of  epidemics  in  a  population.  This  model  enables  us  to 
prove  the  famous  “threshold  theorem  of  epidemiology,”  which  states  that 
an  epidemic  will  occur  only  if  the  number  of  people  susceptible  to  the 
disease  exceeds  a  certain  threshold  value.  We  also  compare  the  predictions 
of  our  model  with  data  from  an  actual  plague  in  Bombay. 

10.  In  Section  4.13  we  derive  a  model  for  the  spread  of  gonorrhea  and 
prove  that  either  this  disease  dies  out,  or  else  the  number  of  people  who 
have  gonorrhea  will  ultimately  approach  a  fixed  value. 

This  textbook  also  contains  the  following  important,  and  often  unique 
features. 

1.  In  Section  1.10  we  give  a  complete  proof  of  the  existence-uniqueness 
theorem  for  solutions  of  first-order  equations.  Our  proof  is  based  on  the 
method  of  Picard  iterates,  and  can  be  fully  understood  by  anyone  who  has 
completed  one  year  of  calculus. 

2.  In  Section  1.11  we  show  how  to  solve  equations  by  iteration.  This 
section  has  the  added  advantage  of  reinforcing  the  reader’s  understanding 
of  the  proof  of  the  existence-uniqueness  theorem. 

3.  Complete  Fortran  and  APL  programs  are  given  for  every  computer 
example  in  the  text.  Computer  problems  appear  in  Sections  1.13-1.17, 
which  deal  with  numerical  approximations  of  solutions  of  differential 
equations;  in  Section  1.11,  which  deals  with  solving  the  equations  x=f(x ) 
and  g(x)  =  0;  and  in  Section  2.8,  where  we  show  how  to  obtain  a  power- 
series  solution  of  a  differential  equation  even  though  we  cannot  explicitly 
solve  the  recurrence  formula  for  the  coefficients. 

4.  A  self-contained  introduction  to  the  computing  language  APL  is 
presented  in  Appendix  C.  Using  this  appendix  we  have  been  able  to  teach 
our  students  APL  in  just  two  lectures. 
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5.  Modesty  aside,  Section  2.12  contains  an  absolutely  super  and  unique 
treatment  of  the  Dirac  delta  function.  We  are  very  proud  of  this  section 
because  it  eliminates  all  the  ambiguities  which  are  inherent  in  the  tradi¬ 
tional  exposition  of  this  topic. 

6.  All  the  linear  algebra  pertinent  to  the  study  of  systems  of  equations  is 
presented  in  Sections  3. 1-3.7.  One  advantage  of  our  approach  is  that  the 
reader  gets  a  concrete  feeling  for  the  very  important  but  extremely  abstract 
properties  of  linear  independence,  spanning,  and  dimension.  Indeed,  many 
linear  algebra  students  sit  in  on  our  course  to  find  out  what’s  really  going 
on  in  their  course. 

Differential  Equations  and  Their  Applications  can  be  used  for  a  one-  or 
two-semester  course  in  ordinary  differential  equations.  It  is  geared  to  the 
student  who  has  completed  two  semesters  of  calculus.  Traditionally,  most 
authors  present  a  “suggested  syllabus”  for  their  textbook.  We  will  not  do  so 
here,  though,  since  there  are  already  more  than  twenty  different  syllabi  in 
use.  Suffice  it  to  say  that  this  text  can  be  used  for  a  wide  variety  of  courses 
in  ordinary  differential  equations. 

I  greatly  appreciate  the  help  of  the  following  people  in  the  preparation 
of  this  manuscript:  Douglas  Reber  who  wrote  the  Fortran  programs, 
Eleanor  Addison  who  drew  the  original  figures,  and  Kate  MacDougall, 
Sandra  Spinacci,  and  Miriam  Green  who  typed  portions  of  this  manu¬ 
script. 

I  am  grateful  to  Walter  Kaufmann-Biihler,  the  mathematics  editor  at 
Springer- Verlag,  and  Elizabeth  Kaplan,  the  production  editor,  for  their 
extensive  assistance  and  courtesy  during  the  preparation  of  this 
manuscript.  It  is  a  pleasure  to  work  with  these  true  professionals. 

Finally,  I  am  especially  grateful  to  Joseph  P.  LaSalle  for  the  encourage¬ 
ment  and  help  he  gave  me.  Thanks  again,  Joe. 


New  York  City 
July ,  1976 


Martin  Braun 
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First-order  differential  equations 


1.1  Introduction 


This  book  is  a  study  of  differential  equations  and  their  applications.  A  dif¬ 
ferential  equation  is  a  relationship  between  a  function  of  time  and  its  de¬ 
rivatives.  The  equations 


and 


❖ 

dt 


=  3y2sin(/+y) 


d*y 

~dS 


=  e~y  +  t  + 


are  both  examples  of  differential  equations.  The  order  of  a  differential 
equation  is  the  order  of  the  highest  derivative  of  the  function  y  that  ap¬ 
pears  in  the  equation.  Thus  (i)  is  a  first-order  differential  equation  and  (ii) 
is  a  third-order  differential  equation.  By  a  solution  of  a  differential  equa¬ 
tion  we  will  mean  a  continuous  function  y(t)  which  together  with  its  de¬ 
rivatives  satisfies  the  relationship.  For  example,  the  function 

y(/)  =  2sin/-}cos2/ 

is  a  solution  of  the  second-order  differential  equation 

d2y 

— -  +y  =  cos2/ 
dt 2 

I  cos 2/) -H  (2 sin/  —  |cos2/) 

=  (  -  2  sin  /  +  f  cos  2  /)  +  2  sin  /  -  }  cos  2  /  =  cos  2  /. 


since 

-^-(2sin/- 

dtlX 
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Differential  equations  appear  naturally  in  many  areas  of  science  and  the 
humanities.  In  this  book,  we  will  present  serious  discussions  of  the  applica¬ 
tions  of  differential  equations  to  such  diverse  and  fascinating  problems  as 
the  detection  of  art  forgeries,  the  diagnosis  of  diabetes,  the  increase  in  the 
percentage  of  sharks  present  in  the  Mediterranean  Sea  during  World  War 
I,  and  the  spread  of  gonorrhea.  Our  purpose  is  to  show  how  researchers 
have  used  differential  equations  to  solve,  or  try  to  solve,  real  life  problems. 
And  while  we  will  discuss  some  of  the  great  success  stories  of  differential 
equations,  we  will  also  point  out  their  limitations  and  document  some  of 
their  failures. 

1.2  First-order  linear  differential  equations 

We  begin  by  studying  first-order  differential  equations  and  we  will  assume 
that  our  equation  is,  or  can  be  put,  in  the  form 

f  =/('./>•  (i) 

The  problem  before  us  is  this:  Given /(/,>>)  find  all  functions  y(t)  which 
satisfy  the  differential  equation  (1).  We  approach  this  problem  in  the 
following  manner.  A  fundamental  principle  of  mathematics  is  that  the  way 
to  solve  a  new  problem  is  to  reduce  it,  in  some  manner,  to  a  problem  that 
we  have  already  solved.  In  practice  this  usually  entails  successively  sim¬ 
plifying  the  problem  until  it  resembles  one  we  have  already  solved.  Since 
we  are  presently  in  the  business  of  solving  differential  equations,  it  is  advis¬ 
able  for  us  to  take  inventory  and  list  all  the  differential  equations  we  can 
solve.  If  we  assume  that  our  mathematical  background  consists  of  just  ele¬ 
mentary  calculus  then  the  very  sad  fact  is  that  the  only  first-order  differen¬ 
tial  equation  we  can  solve  at  present  is 

dy 

(2) 

where  g  is  any  integrable  function  of  time.  To  solve 
integrate  both  sides  with  respect  to  t,  which  yields 

7(0  =  /  g{t)dt  +  c. 

Here  c  is  an  arbitrary  constant  of  integration,  and  by 

anti-derivative  of  g,  that  is,  a  function  whose  derivative  is  g.  Thus,  to  solve 
any  other  differential  equation  we  must  somehow  reduce  it  to  the  form  (2). 
As  we  will  see  in  Section  1.9,  this  is  impossible  to  do  in  most  cases.  Hence, 
we  will  not  be  able,  without  the  aid  of  a  computer,  to  solve  most  differen¬ 
tial  equations.  It  stands  to  reason,  therefore,  that  to  find  those  differential 
equations  that  we  can  solve,  we  should  start  with  very  simple  equations 


Equation  (2)  simply 


f  g(0dt 


we  mean  an 
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and  not  ones  like 

—  =  z>sin(/  — 37VM  ) 

dt 

(which  incidentally,  cannot  be  solved  exactly).  Experience  has  taught  us 
that  the  “simplest”  equations  are  those  which  are  linear  in  the  dependent 
variable  y. 

Definition.  The  general  first-order  linear  differential  equation  is 

+a(t)y  =  b(t ).  (3) 

Unless  otherwise  stated,  the  functions  a(t )  and  b{t)  are  assumed  to  be 
continuous  functions  of  time.  We  single  out  this  equation  and  call  it  lin¬ 
ear  because  the  dependent  variable  y  appears  by  itself,  that  is,  no  terms 
such  as  e~y,  y3  or  siny  etc.  appear  in  the  equation.  For  example  dy / dt 
=y 2  +  sin  t  and  dy  /  dt  =  cosy  +  t  are  both  nonlinear  equations  because  of 
they2  and  cosy  terms  respectively. 

Now  it  is  not  immediately  apparent  how  to  solve  Equation  (3).  Thus,  we 
simplify  it  even  further  by  setting  b(t)  —  0. 


Definition.  The  equation 

-jfi+a(t)y  =  0  (4) 

is  called  the  homogeneous  first-order  linear  differential  equation,  and 
Equation  (3)  is  called  the  nonhomogeneous  first-order  linear  differential 
equation  for  b(t)  not  identically  zero. 


Fortunately,  the  homogeneous  equation  (4)  can  be  solved  quite  easily. 
First,  divide  both  sides  of  the  equation  by  y  and  rewrite  it  in  the  form 


Second,  observe  that 


dy_ 

dt 


y 


a(t). 


dt 

y 


where  by  ln|y(/)|  we  mean  the  natural  logarithm  of  |y(/)|.  Hence  Equation 
(4)  can  be  written  in  the  form 


j\n\y{t)\  =  -  ci(t). 
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But  this  is  Equation  (2)  “essentially”  since  we  can  integrate  both  sides  of 
(5)  to  obtain  that 


ln|y(/)|  =  -  J a(t)dt  +  cx 

where  cx  is  an  arbitrary  constant  of  integration.  Taking  exponentials  of 
both  sides  yields 


or 


|y(0|  =  exp 


=  cexp 


y  (t)zxp^j  a(t)dt^ 


=  c. 


(6) 


Now,  y (7)  exp^  J a(t)dt j  is  a  continuous  function  of  time  and  Equation  (6) 

states  that  its  absolute  value  is  constant.  But  if  the  absolute  value  of  a  con¬ 
tinuous  function  g(t )  is  constant  then  g  itself  must  be  constant.  To  prove 
this  observe  that  if  g  is  not  constant,  then  there  exist  two  different  times  tx 
and  t2  for  which  g(tY)  =  c  and  g(t2)—  —  c.  By  the  intermediate  value  theo¬ 
rem  of  calculus  g  must  achieve  all  values  between  —  c  and  +  c  which  is  im¬ 
possible  if  |g(0|  =  c.  Hence,  we  obtain  the  equation  y(t)txp^j  a(t)dt^  =  c 


y(t)  =  cexp^  —  J  a(t)dt j.  (7) 

Equation  (7)  is  said  to  be  the  general  solution  of  the  homogeneous  equa¬ 
tion  since  every  solution  of  (4)  must  be  of  this  form.  Observe  that  an  arbi¬ 
trary  constant  c  appears  in  (7).  This  should  not  be  too  surprising.  Indeed, 
we  will  always  expect  an  arbitrary  constant  to  appear  in  the  general  solu¬ 
tion  of  any  first-order  differential  equation.  To  wit,  if  we  are  given  dy  /  dt 
and  we  want  to  recover  y(t),  then  we  must  perform  an  integration,  and 
this,  of  necessity,  yields  an  arbitrary  constant.  Observe  also  that  Equation 
(4)  has  infinitely  many  solutions;  for  each  value  of  c  we  obtain  a  distinct 
solution  y(t). 


Example  1.  Find  the  general  solution  of  the  equation  (dy / dt)  +  2ty  =  0. 
Solution.  Here  a(t)  —  2t  so  that y(t)  =  cexp(  —  f  2 tdt)  =  ce~/2. 


Example  2.  Determine  the  behavior,  as  f-»oo,  of  all  solutions  of  the  equa¬ 
tion  (dy  /  dt)  +  ay  =  0,  a  constant. 

Solution.  The  general  solution  is  y(t)  =  cexp|—  Ja dtj  =  ce~at.  Hence  if 

a  <  0,  all  solutions,  with  the  exception  of  y  =  0,  approach  infinity,  and  if  a 
>0,  all  solutions  approach  zero  as  t-^oo. 


4 


1.2  First-order  linear  differential  equations 


In  applications,  we  are  usually  not  interested  in  all  solutions  of  (4). 
Rather,  we  are  looking  for  the  specific  solution  y(t)  which  at  some  initial 
time  t0  has  the  value  y0.  Thus,  we  want  to  determine  a  function  y(t)  such 
that 


—  +a(t)y  =  0,  y{t0)=y0.  (8) 

Equation  (8)  is  referred  to  as  an  initial-value  problem  for  the  obvious  rea¬ 
son  that  of  the  totality  of  all  solutions  of  the  differential  equation,  we  are 
looking  for  the  one  solution  which  initially  (at  time  t0)  has  the  value  y0.  To 
find  this  solution  we  integrate  both  sides  of  (5)  between  t0  and  t.  Thus 


f  4-lnly(s)lds=-  f  a(s)ds 

Jt0 

and,  therefore 


ln|^(/)|-ln|^(;0)|  =  ln 


y(t) 

y(t  o) 


Taking  exponentials  of  both  sides  of  this  equation  we  obtain  that 


y(t) 

y(*o) 


or 


y«) 

y(‘o) 


exp 


f  a(s)ds 
*0 


The  function  inside  the  absolute  value  sign  is  a  continuous  function  of 
time.  Thus,  by  the  argument  given  previously,  it  is  either  identically  +  1  or 
identically  —  1.  To  determine  which  one  it  is,  evaluate  it  at  the  point  /0; 
since 


we  see  that 


f  toa(s)ds 
(o 


f  a(s)ds 
*0 


Hence 


y(t)=y(t0)Qx p 


5 


1  First-order  differential  equations 


Example  3.  Find  the  solution  of  the  initial-value  problem 

4) l 

dt 

Solution.  Here  a(t)  =  sint  so  that 


+  (sin/)>'  =  0,  >'(0)  =  f. 


y(t)  =  lexpl-J  sinsds)  =  le(cos,)  *. 


Example  4.  Find  the  solution  of  the  initial-value  problem 

^(1)  =  2. 


1+^=0, 


2 

Solution.  Her ea(t)  =  et  so  that 


y(t)  =  2exp^  —  J  esl ds 


Now,  at  first  glance  this  problem  would  seem  to  present  a  very  serious  dif- 
ficulty  in  that  we  cannot  integrate  the  function  es  directly.  However,  this 
solution  is  equally  as  valid  and  equally  as  useful  as  the  solution  to  Example 
3.  The  reason  for  this  is  twofold.  First,  there  are  very  simple  numerical 
schemes  to  evaluate  the  above  integral  to  any  degree  of  accuracy  with  the 
aid  of  a  computer.  Second,  even  though  the  solution  to  Example  3  is  given 
explicitly,  we  still  cannot  evaluate  it  at  any  time  t  without  the  aid  of  a  table 
of  trigonometric  functions  and  some  sort  of  calculating  aid,  such  as  a  slide 
rule,  electronic  calculator  or  digital  computer. 

We  return  now  to  the  nonhomogeneous  equation 

dy 


dt 


+  a(t)y  =  b(t). 


It  should  be  clear  from  our  analysis  of  the  homogeneous  equation  that  the 
way  to  solve  the  nonhomogeneous  equation  is  to  express  it  in  the  form 

^  (“something”)  =  b(t) 

and  then  to  integrate  both  sides  to  solve  for  “something”.  However,  the  ex¬ 
pression  (dy  /  dt)  +  a(t)y  does  not  appear  to  be  the  derivative  of  some  sim¬ 
ple  expression.  The  next  logical  step  in  our  analysis  therefore  should  be  the 
following:  Can  we  make  the  left  hand  side  of  the  equation  to  be  d/ dt  of 
“something”?  More  precisely,  we  can  multiply  both  sides  of  (3)  by  any 
continuous  function  /i(t)  to  obtain  the  equivalent  equation 


dy 

+  a(t) 


(9) 
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(By  equivalent  equations  we  mean  that  every  solution  of  (9)  is  a  solution  of 
(3)  and  vice-versa.)  Thus,  can  we  choose  //,(/)  so  that  [i(t)(dy  /  dt)  + 
a(t)  is  the  derivative  of  some  simple  expression?  The  answer  to  this 
question  is  yes,  and  is  obtained  by  observing  that 


Hence,  [i(t)(dy / dt)  +  a(t)  fi(t)y  will  be  equal  to  the  derivative  of  ti(t)y  if 
and  only  if  d\i(t)/ dt  =  a(t)  But  this  is  a  first-order  linear  homoge¬ 
neous  equation  for  fi(t)9  i.e.  (dp/dt)  —  a(t)  fjL  =  0  which  we  already  know 
how  to  solve,  and  since  we  only  need  one  such  function  ji(t)  we  set  the 
constant  c  in  (7)  equal  to  one  and  take 

fi(t)  =  exp^J  a(t)dtj. 


For  this  j u(t),  Equation  (9)  can  be  written  as 

H(t)y  =  n(t)b(t).  (10) 

To  obtain  the  general  solution  of  the  nonhomogeneous  equation  (3),  that 
is,  to  find  all  solutions  of  the  nonhomogeneous  equation,  we  take  the  indef¬ 
inite  integral  (anti-derivative)  of  both  sides  of  (10)  which  yields 


/  ji(t)b(t)dt  +  c 


or 


y: “  (f  vWHt) dt  +  c)  - exp(  —  j  a(t)dt^^j  [x(t)b(t)dt  +  cj.  (11) 


Alternately,  if  we  are  interested  in  the  specific  solution  of  (3)  satisfying 
the  initial  condition  y(t0)=y0,  that  is,  if  we  want  to  solve  the  initial- value 
problem 

dy 

—  +  a(t)y  =  b(t ),  y(t0)=y0 


then  we  can  take  the  definite  integral  of  both  sides  of  (10)  between  t0  and  t 
to  obtain  that 


or 


f  fi(s)b(s)ds 

Jto 

/A('o)>;o+  f  li(s)b(s)ds 


ll(t) 


Remark  1.  Notice  how  we  used  our  knowledge  of  the  solution  of  the  ho¬ 
mogeneous  equation  to  find  the  function  \i(t)  which  enables  us  to  solve  the 
nonhomogeneous  equation.  This  is  an  excellent  illustration  of  how  we  use 
our  knowledge  of  the  solution  of  a  simpler  problem  to  solve  a  harder  prob¬ 
lem. 
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Remark  2.  The  function  jtt(0  =  exp a(t)dtj  is  called  an  integrating  factor 

for  the  nonhomogeneous  equation  since  after  multiplying  both  sides  by 
j u(0  we  can  immediately  integrate  the  equation  to  find  all  solutions. 

Remark  3.  The  reader  should  not  memorize  formulae  (11)  and  (12). 
Rather,  we  will  solve  all  nonhomogeneous  equations  by  first  multiplying 
both  sides  by  fi(t ),  by  writing  the  new  left-hand  side  as  the  derivative  of 
li(t)y(t),  and  then  by  integrating  both  sides  of  the  equation. 


Remark  4.  An  alternative  way  of  solving  the  initial-value  problem  (dy/dt) 
+  a(t)y  =  b(t),  y(t0)=y0  is  to  find  the  general  solution  (11)  of  (3)  and  then 
use  the  initial  condition  y(r0)=y0  to  evaluate  the  constant  c.  If  the  function 
\i (t)b(t)  cannot  be  integrated  directly,  though,  then  we  must  take  the  defi¬ 
nite  integral  of  (10)  to  obtain  (12),  and  this  equation  is  then  approximated 
numerically. 


Example  5.  Find  the  general  solution  of  the  equation  (dy  / dt)-2ty  =  t. 
Solution.  Here  a(0  =  —  2/  so  that 

fi(t)  =  exp^J  a(t)dt^  =  exp^  —  J  2  tdt^  =  e~‘2. 

Multiplying  both  sides  of  the  equation  by  fx(t)  we  obtain  the  equivalent 


equation 

\  *  -i1  d  -i1  .  _,2 

J  =  te  or  —  e  y  =  te  . 

Hence, 

_  -t2 

li 

to* 

1 

<*> 

te~*2  dt  +  c  =  — - - H  c 

and 

y(i)=-\  +  ce'2. 

Example  6.  Find  the  solution  of  the  initial- value  problem 

dy 


dt 


+  2ty  =  t,  y(  1)  =  2. 


Solution.  Here  a(t)  =  2t  so  that 

/x(t)  =  exp^  j  a(t)dt^  =  exp^  j  2  tdt^  =  e*1. 


Multiplying  both  sides  of  the  equation  by  n(t)  we  obtain  that 

e'2(^i+2ty)=te'2  or  ^(e'-K)=,e'2- 
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Hence, 

ii 

05 

05 

-«|-3 

so  that 

II 

**  *— m 

<*> 

Consequently, 

t2  -1  ^ 

ey  —  2e=  — 

and 

1  3e  _,2 
>'=-,+  «  = 

se 5  ds 


e_ 

2 


1  +  -e 
2*2*' 


Example  7.  Find  the  solution  of  the  initial- value  problem 

dy  \ 

Tt+y= TT7’  H2)-x 

Solution.  Here  a(t)=  1,  so  that 

fi(t)  =  exp^Ja(t)dtJ  =  exp^J  1  dt^  =  e*. 

Multiplying  both  sides  of  the  equation  by  fi(t)  we  obtain  that 

t(  &  ,  \  el  d  t  el 

el  —  + y  = - -  or  —  ey  = - - . 

dt  )  1  +  /2  1  +  t2 


Hence 


so  that 


and 


ds 


esy(s )  ds 


-f 


1  +  S' 


ds , 


e'y  — 


-r 


1  +  5' 


y  =  e  ' 


3e2  + 


i: 


1  +  s4 


ds 


Exercises 

In  each  of  Problems  1-7  find  the  general  solution  of  the  given  differential 
equation. 


dy 

1.  ^-+ycos/  =  0 


dy  — 

2.  -tt  +.yV/  sin/  =  0 
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3  —  + 

dt  1  +  ,2' 

5.  %+t2y  =  1 
dt 

ndy  t 
7 .  ~r  + - r  j 

dt  1  +  ;2J 


1  +  r 


4.  <^-+y  =  te' 
dt 

6. 

dt  J 


l'TT?y 


In  each  of  Problems  8-14,  find  the  solution  of  the  given  initial-value  prob¬ 
lem. 


8.  ^  +  Vl  +  /2y  =  0,  y( 0)=V5 


9.  -I-  V 1  +  t2  e  ly  =  0,  y(0)  = 


10.  ^  +  V\  +  t2  e~'y  =  0.  y( 0)  =  0  H.—-2 ty-t,  >’(0)=1 


12.  ^-  +  /y=l  +  r,  y{\)  =  0 


^  ,  1 

13.  —r  +  v  = - - 

dt  '  i  +  ,2 


>(D- 


14.  -dj-2ty=  1,  y(0)=l 

15.  Find  the  general  solution  of  the  equation 

(\  +  t2)^+ty  =  {l  +  t2)5/2. 
( Hint :  Divide  both  sides  of  the  equation  by  1  +  /2.) 

16.  Find  the  solution  of  the  initial-value  problem 

/i  *  2  \  dy 


(1  +  <  ) “j7  +  4(y  =  /,  y(l)  =  *. 


17.  Find  a  continuous  solution  of  the  initial-value  problem 

y’+y  =  g(t ),  y(0)  =  0 


where 


o(/)as/2,  0<*<1 

10,  />1 


18.  Show  that  every  solution  of  the  equation  (dy  /  dt)+  ay  =  be~ct  where  a  and  c 
are  positive  constants  and  b  is  any  real  number  approaches  zero  as  t  ap¬ 
proaches  infinity. 

19.  Given  the  differential  equation  (dy/dt)  +  a(t)y  =f(t)  with  a(t)  and  /(/)  con¬ 
tinuous  for  -  oo  <  t  <  oo,  a(t)>c>  0,  and  lim^^ /(/)  =  0,  show  that  every 
solution  tends  to  zero  as  t  approaches  infinity. 

When  we  derived  the  solution  of  the  nonhomogeneous  equation  we  tacitly 
assumed  that  the  functions  a(t)  and  b(t)  were  continuous  so  that  we  could 
perform  the  necessary  integrations.  If  either  of  these  functions  was  discon¬ 
tinuous  at  a  point  /1?  then  we  would  expect  that  our  solutions  might  be  dis¬ 
continuous  at  /  =  /,.  Problems  20-23  illustrate  the  variety  of  things  that 
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may  happen.  In  Problems  20-22  determine  the  behavior  of  all  solutions  of 
the  given  differential  equation  as  t—>0,  and  in  Problem  23  determine  the 
behavior  of  all  solutions  as  / 2. 


,ft  dy  ,  l 

20.  -j-  +  —y 
dt  tJ 


dy  .  l  , , 

22.  +  yy  =  cos/  + 


sin/ 


^  1  V7  /2 

21.  ~r  H - y  =  e  t/z 

dt  V7 

23.  +y  tan /  =  sin /cos/. 


1.3  The  Van  Meegeren  art  forgeries 

After  the  liberation  of  Belgium  in  World  War  II,  the  Dutch  Field  Security 
began  its  hunt  for  Nazi  collaborators.  They  discovered,  in  the  records  of  a 
firm  which  had  sold  numerous  works  of  art  to  the  Germans,  the  name  of  a 
banker  who  had  acted  as  an  intermediary  in  the  sale  to  Goering  of  the 
painting  “Woman  Taken  in  Adultery”  by  the  famed  17th  century  Dutch 
painter  Jan  Vermeer.  The  banker  in  turn  revealed  that  he  was  acting  on 
behalf  of  a  third  rate  Dutch  painter  H.  A.  Van  Meegeren,  and  on  May  29, 
1945  Van  Meegeren  was  arrested  on  the  charge  of  collaborating  with  the 
enemy.  On  July  12,  1945  Van  Meegeren  startled  the  world  by  announcing 
from  his  prison  cell  that  he  had  never  sold  “Woman  Taken  in  Adultery”  to 
Goering.  Moreover,  he  stated  that  this  painting  and  the  very  famous  and 
beautiful  “Disciples  at  Emmaus”,  as  well  as  four  other  presumed  Vermeers 
and  two  de  Hooghs  (a  17th  century  Dutch  painter)  were  his  own  work. 
Many  people,  however,  thought  that  Van  Meegeren  was  only  lying  to  save 
himself  from  the  charge  of  treason.  To  prove  his  point,  Van  Meegeren  be¬ 
gan,  while  in  prison,  to  forge  the  Vermeer  painting  “Jesus  Amongst  the 
Doctors”  to  demonstrate  to  the  skeptics  just  how  good  a  forger  of  Vermeer 
he  was.  The  work  was  nearly  completed  when  Van  Meegeren  learned  that 
a  charge  of  forgery  had  been  substituted  for  that  of  collaboration.  He, 
therefore,  refused  to  finish  and  age  the  painting  so  that  hopefully  investiga¬ 
tors  would  not  uncover  his  secret  of  aging  his  forgeries.  To  settle  the  ques¬ 
tion  an  international  panel  of  distinguished  chemists,  physicists  and  art 
historians  was  appointed  to  investigate  the  matter.  The  panel  took  x-rays 
of  the  paintings  to  determine  whether  other  paintings  were  underneath 
them.  In  addition,  they  analyzed  the  pigments  (coloring  materials)  used  in 
the  paint,  and  examined  the  paintings  for  certain  signs  of  old  age. 

Now,  Van  Meegeren  was  well  aware  of  these  methods.  To  avoid  detec¬ 
tion,  he  scraped  the  paint  from  old  paintings  that  were  not  worth  much, 
just  to  get  the  canvas,  and  he  tried  to  use  pigments  that  Vermeer  would 
have  used.  Van  Meegeren  also  knew  that  old  paint  was  extremely  hard, 
and  impossible  to  dissolve.  Therefore,  he  very  cleverly  mixed  a  chemical, 
phenoformaldehyde,  into  the  paint,  and  this  hardened  into  bakelite  when 
the  finished  painting  was  heated  in  an  oven. 
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However,  Van  Meegeren  was  careless  with  several  of  his  forgeries,  and 
the  panel  of  experts  found  traces  of  the  modern  pigment  cobalt  blue.  In 
addition,  they  also  detected  the  phenoformaldehyde,  which  was  not  dis¬ 
covered  until  the  turn  of  the  19th  century,  in  several  of  the  paintings.  On 
the  basis  of  this  evidence  Van  Meegeren  was  convicted,  of  forgery,  on  Oc¬ 
tober  12,  1947  and  sentenced  to  one  year  in  prison.  While  in  prison  he 
suffered  a  heart  attack  and  died  on  December  30,  1947. 

However,  even  following  the  evidence  gathered  by  the  panel  of  experts, 
many  people  still  refused  to  believe  that  the  famed  “Disciples  at  Emmaus” 
was  forged  by  Van  Meegeren.  Their  contention  was  based  on  the  fact  that 
the  other  alleged  forgeries  and  Van  Meegeren’s  nearly  completed  “Jesus 
Amongst  the  Doctors”  were  of  a  very  inferior  quality.  Surely,  they  said,  the 
creator  of  the  beautiful  “Disciples  at  Emmaus”  could  not  produce  such  in¬ 
ferior  pictures.  Indeed,  the  “Disciples  at  Emmaus”  was  certified  as  an 
authentic  Vermeer  by  the  noted  art  historian  A.  Bredius  and  was  bought 
by  the  Rembrandt  Society  for  $170,000.  The  answer  of  the  panel  to  these 
skeptics  was  that  because  Van  Meegeren  was  keenly  disappointed  by  his 
lack  of  status  in  the  art  world,  he  worked  on  the  “Disciples  at  Emmaus” 
with  the  fierce  determination  of  proving  that  he  was  better  than  a  third 
rate  painter.  After  producing  such  a  masterpiece  his  determination  was 
gone.  Moreover,  after  seeing  how  easy  it  was  to  dispose  of  the  “Disciples  at 
Emmaus”  he  devoted  less  effort  to  his  subsequent  forgeries.  This  explana¬ 
tion  failed  to  satisfy  the  skeptics.  They  demanded  a  thoroughly  scientific 
and  conclusive  proof  that  the  “Disciples  at  Emmaus”  was  indeed  a  forgery. 
This  was  done  recently  in  1967  by  scientists  at  Carnegie  Mellon  University, 
and  we  would  now  like  to  describe  their  work. 

The  key  to  the  dating  of  paintings  and  other  materials  such  as  rocks  and 
fossils  lies  in  the  phenomenon  of  radioactivity  discovered  at  the  turn  of  the 
century.  The  physicist  Rutherford  and  his  colleagues  showed  that  the 
atoms  of  certain  “radioactive”  elements  are  unstable  and  that  within  a 
given  time  period  a  fixed  proportion  of  the  atoms  spontaneously  disin¬ 
tegrates  to  form  atoms  of  a  new  element.  Because  radioactivity  is  a  prop¬ 
erty  of  the  atom,  Rutherford  showed  that  the  radioactivity  of  a  substance 
is  directly  proportional  to  the  number  of  atoms  of  the  substance  present. 
Thus,  if  N  (t)  denotes  the  number  of  atoms  present  at  time  /,  then  dN/dt , 
the  number  of  atoms  that  disintegrate  per  unit  time  is  proportional  to  V, 
that  is, 

=  -  XN.  (1) 

dt  v  ’ 

The  constant  A  which  is  positive,  is  known  as  the  decay  constant  of  the 
substance.  The  larger  A  is,  of  course,  the  faster  the  substance  decays.  One 
measure  of  the  rate  of  disintegration  of  a  substance  is  its  half-life  which  is 
defined  as  the  time  required  for  half  of  a  given  quantity  of  radioactive 
atoms  to  decay.  To  compute  the  half-life  of  a  substance  in  terms  of  A, 
assume  that  at  time  /0,  V(70)  =  V0.  Then,  the  solution  of  the  initial-value 
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problem  dN / dt  —  —  XN,  N  (t0)  =  7V0  is 

N(t)  =  A^0exp|—  xj^  =  A^0e — x^/  — 

or  N / N0  =  exp(  —  X(t  — 10)).  Taking  logarithms  of  both  sides  we  obtain  that 

=  (2) 

iy0 

Now,  if  N /Nq=\  then  —  X(t  —  /0)  =  ln^  so  that 

(;-/0)=^  =  ^i.  (3) 

Thus,  the  half-life  of  a  substance  is  In  2  divided  by  the  decay  constant  X. 
The  dimension  of  A,  which  we  suppress  for  simplicity  of  writing,  is  recipro¬ 
cal  time.  If  t  is  measured  in  years  then  X  has  the  dimension  of  reciprocal 
years,  and  if  t  is  measured  in  minutes,  then  X  has  the  dimension  of  recipro¬ 
cal  minutes.  The  half-lives  of  many  substances  have  been  determined  and 
recorded.  For  example,  the  half-life  of  carbon- 14  is  5568  years  and  the 
half-life  of  uranium-238  is  4.5  billion  years. 

Now  the  basis  of  “radioactive  dating”  is  essentially  the  following.  From 
Equation  (2)  we  can  solve  for  t - 10=  \/X\n(N0/ N).  If  t0  is  the  time  the 
substance  was  initially  formed  or  manufactured,  then  the  age  of  the  sub¬ 
stance  is  1  /X\n(N0/ N).  The  decay  constant  X  is  known  or  can  be  com¬ 
puted,  in  most  instances.  Moreover,  we  can  usually  evaluate  N  quite  easily. 
Thus,  if  we  knew  N0  we  could  determine  the  age  of  the  substance.  But  this 
is  the  real  difficulty  of  course,  since  we  usually  do  not  know  N0.  In  some 
instances  though,  we  can  either  determine  N0  indirectly,  or  else  determine 
certain  suitable  ranges  for  N0,  and  such  is  the  case  for  the  forgeries  of  Van 
Meegeren. 

We  begin  with  the  following  well-known  facts  of  elementary  chemistry. 
Almost  all  rocks  in  the  earth’s  crust  contain  a  small  quantity  of  uranium. 
The  uranium  in  the  rock  decays  to  another  radioactive  element,  and  that 
one  decays  to  another  and  another,  and  so  forth  (see  Figure  1)  in  a  series 
of  elements  that  results  in  lead,  which  is  not  radioactive.  The  uranium 
(whose  half-life  is  over  four  billion  years)  keeps  feeding  the  elements 
following  it  in  the  series,  so  that  as  fast  as  they  decay,  they  are  replaced  by 
the  elements  before  them. 

Now,  all  paintings  contain  a  small  amount  of  the  radioactive  element 
lead-210  (210Pb),  and  an  even  smaller  amount  of  radium-226  (226Ra),  since 
these  elements  are  contained  in  white  lead  (lead  oxide),  which  is  a  pigment 
that  artists  have  used  for  over  2000  years.  For  the  analysis  which  follows,  it 
is  important  to  note  that  white  lead  is  made  from  lead  metal,  which,  in 
turn,  is  extracted  from  a  rock  called  lead  ore,  in  a  process  called  smelting. 
In  this  process,  the  lead-210  in  the  ore  goes  along  with  the  lead  metal. 
However,  90-95%  of  the  radium  and  its  descendants  are  removed  with 
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Figure  1.  The  Uranium  series.  (The  times  shown  on  the  arrows  are  the  half-lives  of  each  step.) 
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other  waste  products  in  a  material  called  slag.  Thus,  most  of  the  supply  of 
lead-210  is  cut  off  and  it  begins  to  decay  very  rapidly,  with  a  half-life  of  22 
years.  This  process  continues  until  the  lead-210  in  the  white  lead  is  once 
more  in  radioactive  equilibrium  with  the  small  amount  of  radium  present, 
i.e.  the  disintegration  of  the  lead-210  is  exactly  balanced  by  the  disintegra¬ 
tion  of  the  radium. 

Let  us  now  use  this  information  to  compute  the  amount  of  lead-210  pre¬ 
sent  in  a  sample  in  terms  of  the  amount  originally  present  at  the  time  of 
manufacture.  Let  y(t)  be  the  amount  of  lead-210  per  gram  of  white  lead 
at  time  t,  y0  the  amount  of  lead-210  per  gram  of  white  lead  present  at 
the  time  of  manufacture  t0 ,  and  r(t)  the  number  of  disintegrations  of 
radium-226  per  minute  per  gram  of  white  lead,  at  time  t.  If  X  is  the  decay 
constant  for  lead-210,  then 

^  =  -Xy  +  r(t),  y{t0)  =>>0.  (4) 

Since  we  are  only  interested  in  a  time  period  of  at  most  300  years  we  may 
assume  that  the  radium-226,  whose  half-life  is  1600  years,  remains  con¬ 
stant,  so  that  r(t)  is  a  constant  r.  Multiplying  both  sides  of  the  differential 
equation  by  the  integrating  factor  }i(t)  =  eXt  we  obtain  that 


Hence 


Xty  —  re 


or 

y{t)=  y(1  —  e~x^t~to))-\-yQe~x^t~t(^.  (5) 

A 


Now  y(/)  and  r  can  be  easily  measured.  Thus,  if  we  knewy0  we  could 
use  Equation  (5)  to  compute  (t  —  t0)  and  consequently,  we  could  determine 
the  age  of  the  painting.  As  we  pointed  out,  though,  we  cannot  measure  y0 
directly.  One  possible  way  out  of  this  difficulty  is  to  use  the  fact  that  the 
original  quantity  of  lead-210  was  in  radioactive  equilibrium  with  the  larger 
amount  of  radium-226  in  the  ore  from  which  the  metal  was  extracted.  Let 
us,  therefore,  take  samples  of  different  ores  and  count  the  number  of  disin¬ 
tegrations  of  the  radium-226  in  the  ores.  This  was  done  for  a  variety  of 
ores  and  the  results  are  given  in  Table  1  below.  These  numbers  vary  from 
0.18  to  140.  Consequently,  the  number  of  disintegrations  of  the  lead-210 
per  minute  per  gram  of  white  lead  at  the  time  of  manufacture  will  vary 
from  0.18  to  140.  This  implies  that  j>0  will  also  vary  over  a  very  large  inter¬ 
val,  since  the  number  of  disintegrations  of  lead-210  is  proportional  to  the 
amount  present.  Thus,  we  cannot  use  Equation  (5)  to  obtain  an  accurate, 
or  even  a  crude  estimate,  of  the  age  of  a  painting. 
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Table  1.  Ore  and  ore  concentrate  samples.  All  disintegration  rates 
are  per  gram  of  white  lead. 


Description  and  Source 

Disintegrations  per  minute  of  226Ra 

Ore  concentrate 

(Oklahoma-Kansas) 

4.5 

Crushed  raw  ore 

(S.E.  Missouri) 

2.4 

Ore  concentrate 

(S.E.  Missouri) 

0.7 

Ore  concentrate 

(Idaho) 

2.2 

Ore  concentrate 

(Idaho) 

0.18 

Ore  concentrate 

(Washington) 

140.0 

Ore  concentrate 

(British  Columbia) 

1.9 

Ore  concentrate 

(British  Columbia) 

0.4 

Ore  concentrate 

(Bolivia) 

1.6 

Ore  concentrate 

(Australia) 

1.1 

However,  we  can  still  use  Equation  (5)  to  distinguish  between  a  17th 
century  painting  and  a  modern  forgery.  The  basis  for  this  statement  is  the 
simple  observation  that  if  the  paint  is  very  old  compared  to  the  22  year 
half-life  of  lead,  then  the  amount  of  radioactivity  from  the  lead-210  in  the 
paint  will  be  nearly  equal  to  the  amount  of  radioactivity  from  the  radium 
in  the  paint.  On  the  other  hand,  if  the  painting  is  modern  (approximately 
20  years  old,  or  so)  then  the  amount  of  radioactivity  from  the  lead-210  will 
be  much  greater  than  the  amount  of  radioactivity  from  the  radium. 

We  make  this  argument  precise  in  the  following  manner.  Let  us  assume 
that  the  painting  in  question  is  either  very  new  or  about  300  years  old.  Set 
t-  /o  =  300  years  in  (5).  Then,  after  some  simple  algebra,  we  see  that 

Ay0==Ay(^)^300A-  r(e300X-  1).  (6) 

If  the  painting  is  indeed  a  modern  forgery,  then  Ay0  will  be  absurdly 
large.  To  determine  what  is  an  absurdly  high  disintegration  rate  we  observe 
(see  Exercise  1)  that  if  the  lead-210  decayed  originally  (at  the  time  of 
manufacture)  at  the  rate  of  100  disintegrations  per  minute  per  gram  of 
white  lead,  then  the  ore  from  which  it  was  extracted  had  a  uranium  con¬ 
tent  of  approximately  0.014  per  cent.  This  is  a  fairly  high  concentration  of 
uranium  since  the  average  amount  of  uranium  in  rocks  of  the  earth’s  crust 
is  about  2.7  parts  per  million.  On  the  other  hand,  there  are  some  very  rare 
ores  in  the  Western  Hemisphere  whose  uranium  content  is  2-3  per  cent.  To 
be  on  the  safe  side,  we  will  say  that  a  disintegration  rate  of  lead-210  is  cer¬ 
tainly  absurd  if  it  exceeds  30,000  disintegrations  per  minute  per  gram  of 
white  lead. 

To  evaluate  Ay0,  we  must  evaluate  the  present  disintegration  rate,  Xy(t), 
of  the  lead-210,  the  disintegration  rate  r  of  the  radium-226,  and  e300\  Since 
the  disintegration  rate  of  polonium-210  (210Po)  equals  that  of  lead-210  after 
several  years,  and  since  it  is  easier  to  measure  the  disintegration  rate  of 
polonium-210,  we  substitute  these  values  for  those  of  lead-210.  To  compute 
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e300X,  we  observe  from  (3)  that  X  =  (In  2/22).  Hence 

e  300A  e  (300/22)ln  2  _  2 ( 1 50/ 1 1 ) 

The  disintegration  rates  of  polonium-210  and  radium-226  were  measured 
for  the  “Disciples  at  Emmaus”  and  various  other  alleged  forgeries  and  are 
given  in  Table  2  below. 


Table  2.  Paintings  of  questioned  authorship.  All  disintegration 
rates  are  per  minute,  per  gram  of  white  lead. 


Description 

210Po  disintegration 

226Ra  disintegration 

“Disciples  at  Emmaus” 

8.5 

0.8 

“Washing  of  Feet” 

12.6 

0.26 

“Woman  Reading  Music” 

10.3 

“Woman  Playing  Mandolin” 

8.2 

0.17 

“Lace  Maker” 

1.5 

1.4 

“Laughing  Girl” 

5.2 

6.0 

If  we  now  evaluate  Xy0  from  (6)  for  the  white  lead  in  the  painting  “Disci¬ 
ples  at  Emmaus”  we  obtain  that 

(8.5)2 150/11  -  0.8(2 150/11 -  1) 

=  98,050 

which  is  unacceptably  large.  Thus,  this  painting  must  be  a  modern  forgery. 
By  a  similar  analysis,  (see  Exercises  2-4)  the  paintings  “Washing  of  Feet”, 
“Woman  Reading  Music”  and  “Woman  Playing  Mandolin”  were  indisput¬ 
ably  shown  to  be  faked  Vermeers.  On  the  other  hand,  the  paintings  “Lace 
Maker”  and  “Laughing  Girl”  cannot  be  recently  forged  Vermeers,  as 
claimed  by  some  experts,  since  for  these  two  paintings,  the  polonium-210  is 
very  nearly  in  radioactive  equilibrium  with  the  radium-226,  and  no  such 
equilibrium  has  been  observed  in  any  samples  from  19th  or  20th  century 
paintings. 
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Exercises 

1.  In  this  exercise  we  show  how  to  compute  the  concentration  of  uranium  in  an 
ore  from  the  dpm/(g  of  Pb)  of  the  lead-210  in  the  ore. 

(a)  The  half-life  of  uranium-238  is  4.51  x  109  years.  Since  this  half-life  is  so 
large,  we  may  assume  that  the  amount  of  uranium  in  the  ore  is  constant 
over  a  period  of  two  to  three  hundred  years.  Let  N(t)  denote  the  number  of 
atoms  of  238U  per  gram  of  ordinary  lead  in  the  ore  at  time  t.  Since  the 
lead-210  is  in  radioactive  equilibrium  with  the  uranium-238  in  the  ore,  we 
know  that  dN /  dt=  —\N=  -  100  dpm/g  of  Pb  at  time  t0.  Show  that  there 
are  3.42  X  1017  atoms  of  uranium-238  per  gram  of  ordinary  lead  in  the  ore 
at  time  /0-  (Hint:  1  year  =  525,600  minutes.) 

(b)  Using  the  fact  that  one  mole  of  uranium-238  weighs  238  grams,  and  that 
there  are  6.02  X  1023  atoms  in  a  mole,  show  that  the  concentration  of 
uranium  in  the  ore  is  approximately  0.014  percent. 

For  each  of  the  paintings  2,  3,  and  4  use  the  data  in  Table  2  to  compute 
the  disintegrations  per  minute  of  the  original  amount  of  lead-210  per  gram 
of  white  lead,  and  conclude  that  each  of  these  paintings  is  a  forged 
Vermeer. 

2.  “Washing  of  Feet” 

3.  “Woman  Reading  Music” 

4.  “Woman  Playing  Mandolin” 

5.  The  following  problem  describes  a  very  accurate  derivation  of  the  age  of 
uranium. 

(a)  Let  #238(f)  and  #235(0  denote  the  number  of  atoms  of  238U  and  235U  at 
time  t  in  a  given  sample  of  uranium,  and  let  /  =  0  be  the  time  this  sample 
was  created.  By  the  radioactive  decay  law, 


4:N235  (t)=  ^  N235  (t). 

at  0.707(10) 


Solve  these  equations  for  #238(/)  and  #23s(0  *n  terms  of  their  original  num¬ 
bers  #238(0)  and  #235( 0). 

(b)  In  1946  the  ratio  of  238U/235U  in  any  sample  was  137.8.  Assuming  that 
equal  amounts  of  238U  and  235U  appeared  in  any  sample  at  the  time  of  its 
creation,  show  that  the  age  of  uranium  is  5.96X  109  years.  This  figure  is 
universally  accepted  as  the  age  of  uranium. 

6.  In  a  samarskite  sample  discovered  recently,  there  was  3  grams  of  Thorium 
(232TH).  Thorium  decays  to  lead-208  (208Pb)  through  the  reaction  232Th—»208Pb 
+  6(44He).  It  was  determined  that  0.0376  of  a  gram  of  lead-208  was  produced 
by  the  disintegration  of  the  original  Thorium  in  the  sample.  Given  that  the 
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half-life  of  Thorium  is  13.9  billion  years,  derive  the  age  of  this  samarskite  sam¬ 
ple.  (Hint:  0.0376  grams  of  208Pb  is  the  product  of  the  decay  of  (232/208)  X 
0.0376  grams  of  Thorium.) 

One  of  the  most  accurate  ways  of  dating  archaeological  finds  is  the  method 
of  carbon- 14  (14C)  dating  discovered  by  Willard  Libby  around  1949.  The  basis 
of  this  method  is  delightfully  simple:  The  atmosphere  of  the  earth  is  continu¬ 
ously  bombarded  by  cosmic  rays.  These  cosmic  rays  produce  neutrons  in  the 
earth’s  atmosphere,  and  these  neutrons  combine  with  nitrogen  to  produce  14C, 
which  is  usually  called  radiocarbon,  since  it  decays  radioactively.  Now,  this  ra¬ 
diocarbon  is  incorporated  in  carbon  dioxide  and  thus  moves  through  the  atmo¬ 
sphere  to  be  absorbed  by  plants.  Animals,  in  turn,  build  radiocarbon  into  their 
tissues  by  eating  the  plants.  In  living  tissue,  the  rate  of  ingestion  of  14C  exactly 
balances  the  rate  of  disintegration  of  14C.  When  an  organism  dies,  though,  it 
ceases  to  ingest  carbon- 14  and  thus  its  14C  concentration  begins  to  decrease 
through  disintegration  of  the  14C  present.  Now,  it  is  a  fundamental  assumption 
of  physics  that  the  rate  of  bombardment  of  the  earth’s  atmosphere  by  cosmic 
rays  has  always  been  constant.  This  implies  that  the  original  rate  of  disintegra¬ 
tion  of  the  14C  in  a  sample  such  as  charcoal  is  the  same  as  the  rate  measured 
today.*  This  assumption  enables  us  to  determine  the  age  of  a  sample  of  char¬ 
coal.  Let  N (t)  denote  the  amount  of  carbon- 14  present  in  a  sample  at  time  /, 
and  N0  the  amount  present  at  time  /  =  0  when  the  sample  was  formed.  If  X  de¬ 
notes  the  decay  constant  of  14C  (the  half-life  of  carbon- 14  is  5568  years)  then 
dN(t)/dt=  —  XN  (t),  7V(0)=V0.  Consequently,  V(/)=  N0e~Xt.  Now  the  present 
rate  R(t)  of  disintegration  of  the  14C  in  the  sample  is  given  by  R(t)  =  XN  (t)  = 
XN0e~Xt  and  the  original  rate  of  disintegration  is  R(0)  =  XNo.  Thus,  R(t)/ R( 0) 
=  e~Xt  so  that  /  =  (l/A)ln[/?(0)/ R(t)].  Hence  if  we  measure  /?(/),  the  present 
rate  of  disintegration  of  the  14C  in  the  charcoal,  and  observe  that  R( 0)  must 
equal  the  rate  of  disintegration  of  the  14C  in  a  comparable  amount  of  living 
wood,  then  we  can  compute  the  age  t  of  the  charcoal.  The  following  two  prob¬ 
lems  are  real  life  illustrations  of  this  method. 

7.  Charcoal  from  the  occupation  level  of  the  famous  Lascaux  Cave  in  France 
gave  an  average  count  in  1950  of  0.97  disintegrations  per  minute  per  gram. 
Living  wood  gave  6.68  disintegrations.  Estimate  the  date  of  occupation  and 
hence  the  probable  date  of  the  remarkable  paintings  in  the  Lascaux  Cave. 

8.  In  the  1950  excavation  at  Nippur,  a  city  of  Babylonia,  charcoal  from  a  roof 
beam  gave  a  count  of  4.09  disintegrations  per  minute  per  gram.  Living  wood 
gave  6.68  disintegrations.  Assuming  that  this  charcoal  was  formed  during  the 
time  of  Hammurabi’s  reign,  find  an  estimate  for  the  likely  time  of  Hamurabi’s 
succession. 


*  Since  the  mid  1950*s  the  testing  of  nuclear  weapons  has  significantly  increased  the  amount  of 
radioactive  carbon  in  our  atmosphere.  Ironically  this  unfortunate  state  of  affairs  provides  us 
with  yet  another  extremely  powerful  method  of  detecting  art  forgeries.  To  wit,  many  artists’ 
materials,  such  as  linseed  oil  and  canvas  paper,  come  from  plants  and  animals,  and  so  will 
contain  the  same  concentration  of  carbon- 14  as  the  atmosphere  at  the  time  the  plant  or 
animal  dies.  Thus  linseed  oil  (which  is  derived  from  the  flax  plant)  that  was  produced  during 
the  last  few  years  will  contain  a  much  greater  concentration  of  carbon- 14  than  linseed  oil  pro¬ 
duced  before  1950. 
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1.4  Separable  equations 


We  solved  the  first-order  linear  homogeneous  equation 


dt 


+  a(t)y  =  0 


by  dividing  both  sides  of  the  equation  by  y(t)  to  obtain  the  equivalent 
equation 


1  w  <t\ 

hT)-*~  -  () 


and  observing  that  Equation  (2)  can  be  written  in  the  form 

j\n\y{t)\=-a{t).  (3) 

We  then  found  ln|y(f)|,  and  consequently  y(f),  by  integrating  both  sides  of 
(3).  In  an  exactly  analogous  manner,  we  can  solve  the  more  general  dif¬ 
ferential  equation 

dy  _  g(0 

dt  f{y) 


where  /  and  g  are  continuous  functions  of  y  and  t.  This  equation,  and  any 
other  equation  which  can  be  put  into  this  form,  is  said  to  be  separable.  To 
solve  (4),  we  first  multiply  both  sides  by  f(y)  to  obtain  the  equivalent 
equation 

ZO)f  =  *(')•  (5) 


Then,  we  observe  that  (5)  can  be  written  in  the  form 


ftF(y(t))  =  g(t) 


where  F(y)  is  any  anti-derivative  of  /(y); 
quently, 


i.e.,  F(y)  =  jf(y)dy.  Conse- 


F(y(0)  =  j  g(t)dt  +  c  (7) 


where  c  is  an  arbitrary  constant  of  integration,  and  we  solve  fory=y(7) 
from  (7)  to  find  the  general  solution  of  (4). 


Example  1.  Find  the  general  solution  of  the  equation  dy / dt—t2 /y2. 
Solution.  Multiplying  both  sides  of  this  equation  by  y2  gives 


y 


2 


dy 

dt 


or 


d  y\t) 
dt  3 


Hence,  y3(7)  =  /3  +  c  where  c  is  an  arbitrary  constant,  and  y(/)  =  (73  + c)l//3. 
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Example  2.  Find  the  general  solution  of  the  equation 


Solution.  This  equation  can  be  written  in  the  form 

4-ey0)  =  t  +  t 3 
dt 

and  thus  ey(<t)  —  t2/2  +  /4/4+  c.  Taking  logarithms  of  both  sides  of  this 
equation  gives  y(t)  =  ln(t2/2  +  t4/ 4  +  c). 

In  addition  to  the  differential  equation  (4),  we  will  often  impose  an  ini¬ 
tial  condition  on  y(t)  of  the  form  y(t0)=y0.  The  differential  equation  (4) 
together  with  the  initial  condition y(t^—yQ  is  called  an  initial- value  prob¬ 
lem.  We  can  solve  an  initial-value  problem  two  different  ways.  Either  we 
use  the  initial  condition  y(/0)=y0  to  solve  for  the  constant  c  in  (7),  or  else 
we  integrate  both  sides  or  (6)  between  t0  and  t  to  obtain  that 

F{y{t))-F(y0)=  f  g(s)ds.  (8) 

J‘o 

If  we  now  observe  that 

F(y)-F(y0)=  fy/(r)dr,  (9) 

•'.Vo 

then  we  can  rewrite  (8)  in  the  simpler  form 

fyf(r)dr=  f  g(s)ds.  (10) 

Jy0  Jto 


Example  3.  Find  the  solution  y(t)  of  the  initial-value  problem 

ey~^  -((  +  tz)  =  0,  ^(1)=1. 


Solution.  Method  (i).  From  Example  2,  we  know  that  the  general  solution 
of  this  equation  is  y  =  ln(72/2+  r4/4+  c).  Setting  t=  1  andy=l  gives  1  = 
ln(3/4+  c),  or  c  =  e  —  3/4.  Hence, y(t)  =  ln(e  — 3/4+  t2 /2+ 14/4). 

Method  (ii).  From  (10), 


Consequently, 


e 


y 


and  y(r)  =  ln(e-3/4+  r2/2+ 14/4). 


Example  4.  Solve  the  initial- value  problem  dy  /  dt=  1  +y2,  y(0)  =  0. 
Solution.  Divide  both  sides  of  the  differential  equation  by  1  +y2  to  obtain 
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the  equivalent  equation  1/(1  +y2)dy / dt  —  1.  Then,  from  (10) 


Consequently,  arc  tany  =  t ,  and  y  =  tan  t. 

The  solution  y  =  tan  t  of  the  above  problem  has  the  disturbing  property 
that  it  goes  to  ±  oo  at  /  =  ±77/2.  And  what’s  even  more  disturbing  is  the 
fact  that  there  is  nothing  at  all  in  this  initial-value  problem  which  even 
hints  to  us  that  there  is  any  trouble  at  t=  ±  77/2.  The  sad  fact  of  life  is  that 
solutions  of  perfectly  nice  differential  equations  can  go  to  infinity  in  finite 
time.  Thus,  solutions  will  usually  exist  only  on  a  finite  open  interval  a<t< 
b ,  rather  than  for  all  time.  Moreover,  as  the  following  example  shows,  dif¬ 
ferent  solutions  of  the  same  differential  equation  usually  go  to  infinity  at 
different  times. 


Example  5.  Solve  the  initial- value  problem  dy  /  dt  —  1  +y2,  y(0)  =  1. 
Solution.  From  (10) 


ds. 


Consequently,  arc  tany  —  arc  tan  1  =  t ,  and  y  =  tan(/  +  77 /4).  This  solution 
exists  on  the  open  interval  —  377/4  <  t  <  77 /4. 


Example  6.  Find  the  solution  y(t)  of  the  initial-value  problem 

dt  '  v*  '  '  7 . ’  ^°)=1- 

Solution.  Dividing  both  sides  of  the  differential  equation  by  1  +y2  gives 


y~37+0  +^2)sin?  =  0, 


y  dy 


1  4*  y  2  dt 


~  -  sin  t. 


Consequently, 


so  that 


fy  rdr  j 

I  - -  =  I  —  sm  sds, 

J,  1  +  r2  Jo 


|ln(l  +y2)  —  ^ln2  =  cos  /  —  1. 


Solving  this  equation  for  y(/)  gives 

y(t)=  ±(2e_4sin2'/2—  1)1/2. 

To  determine  whether  we  take  the  plus  or  minus  branch  of  the  square  root, 
we  note  thaty(O)  is  positive.  Hence, 

y(t)  =  (2e~4sin2'/2  —  l)l/2 
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This  solution  is  only  defined  when 

2e-4sinV2>  j 


or 


e4sin2'/2<  2.  (11) 

Since  the  logarithm  function  is  monotonic  increasing,  we  may  take  loga¬ 
rithms  of  both  sides  of  (1 1)  and  still  preserve  the  inequality.  Thus,  4sin2//2 
<  In  2,  which  implies  that 


t_ 

2 


<  arc  sin 


Vln2 

2 


Therefore,  y(t)  only  exists  on  the  open  interval  (  —  a, a)  where 

a  =  2arcsin[  Vln2  /2]. 

Now,  this  appears  to  be  a  new  difficulty  associated  with  nonlinear  equa¬ 
tions,  since  y(t)  just  “disappears”  at  t  =  ±  a,  without  going  to  infinity. 
However,  this  apparent  difficulty  can  be  explained  quite  easily,  and  more¬ 
over,  can  even  be  anticipated,  if  we  rewrite  the  differential  equation  above 
in  the  standard  form 

cfy  _  (l+.y2)sinr 
dt  y 

Notice  that  this  differential  equation  is  not  defined  when  >>  =  ().  Therefore, 
if  a  solution  y(t)  achieves  the  value  zero  at  some  time  /=/*,  then  we 
cannot  expect  it  to  be  defined  for  t  >  t*.  This  is  exactly  what  happens  here, 
since  j>(±  a)  =  0. 


Example  7.  Solve  the  initial-value  problem  dy /dt  =  (\  +y)t,  y( 0)=  —  1. 
Solution.  In  this  case,  we  cannot  divide  both  sides  of  the  differential  equa¬ 
tion  by  1  +y ,  since y( 0)=  —  1.  However,  it  is  easily  seen  that y(t)  =  —  1  is 
one  solution  of  this  initial-value  problem,  and  in  Section  1.10  we  show  that 
it  is  the  only  solution.  More  generally,  consider  the  initial-value  problem 
dy / dt  =  f(y) g(t),  y(t0)  =  y0,  where  /(y0)  =  0.  Certainly,  y(t)  =  y0  is  one 
solution  of  this  initial-value  problem,  and  in  Section  1.10  we  show  that  it  is 
the  only  solution  if  3//  dy  exists  and  is  continuous. 

Example  8.  Solve  the  initial-value  problem 

(1  ey)dy / dt  =  cos  /,  y(7r/2)  =  3. 

Solution.  From  (10), 


+  e 


cos  sds 


so  that  y  +  ey  =  2  +  e3  +  sin  /.  This  equation  cannot  be  solved  explicitly  for  y 
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as  a  function  of  t.  Indeed,  most  separable  equations  cannot  be  solved  ex¬ 
plicitly  for/  as  a  function  of  t.  Thus,  when  we  say  that 

/  +  ey  =  2  +  e3  +  sin  t 

is  the  solution  of  this  initial-value  problem,  we  really  mean  that  it  is  an  im¬ 
plicit,  rather  than  an  explicit  solution.  This  does  not  present  us  with  any 
difficulties  in  applications,  since  we  can  always  find/(/)  numerically  with 
the  aid  of  a  digital  computer  (see  Section  1.11). 

Example  9.  Find  all  solutions  of  the  differential  equation  dy  /  dt=  —  t/y. 
Solution.  Multiplying  both  sides  of  the  differential  equation  by  y  gives 
ydy/dt=  —  t.  Hence 


yi+t2  =  c2.  (12) 

Now,  the  curves  (12)  are  closed ,  and  we  cannot  solve  for  /  as  a  single-val¬ 
ued  function  of  t.  The  reason  for  this  difficulty,  of  course,  is  that  the  dif¬ 
ferential  equation  is  not  defined  when/  =  0.  Nevertheless,  the  circles  t2-\-y 2 
=  c2  are  perfectly  well  defined,  even  when  /  =  0.  Thus,  we  will  call  the 
circles  t2+y2=c2  solution  curves  of  the  differential  equation 

dy  /  dt  —  —  t/y. 

More  generally,  we  will  say  that  any  curve  defined  by  (7)  is  a  solution 
curve  of  (4). 


Exercises 

In  each  of  Problems  1-5,  find  the  general  solution  of  the  given  differential 
equation. 


,2s 


1.  (1 4* r)-r-  =  1  +/ .  Hint:  tan(x+/)  = 


dt 


tanx  +  tan/ 

1  -  tan  x  tan y  ' 


dy 


2.  ^-=(1  +  0(1+^) 

4.  ^-~e‘+y+3 
dt 


3. 


dy 

dt 


=  1  -t+y2-ty: 
dy 


5.  cos/  sin  t  =  sin /  cos  t 


In  each  of  Problems  6-12,  solve  the  given  initial- value  problem,  and  de¬ 
termine  the  interval  of  existence  of  each  solution. 


dy 

dt 


6.  t2(l+y2)  +  2y^-=0,  >(0)-l 


7. 


dy_ 

dt 


It 


y+yf 


y(  2) =3 
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8.  (l  +  /2)1/2^-  =  (y3(l  +  <2)-1/2,  >> (0)  =  1 


9. 


dy  3r2  +  4<  +  2 


dt 


2(^-1) 


7(0)= -1 


dy 

dt 


-  tsiny  / 1  \  ^ 

10.  cos7^7=  ^  — ,  y(\)  =  ir/2 


1  +  r 


dy 


11.  -j-  =  k(a  —y){b  —y),  7(0)  =  0,  a,b>0 


dy 

12.  3t-^  —ycost,  7(1)  =  0 

13.  Any  differential  equation  of  the  form  dy /  dt=f(y)  is  separable.  Thus,  we  can 
solve  all  those  first-order  differential  equations  in  which  time  does  not  appear 
explicitly.  Now,  suppose  we  have  a  differential  equation  of  the  form  dy/dt  = 
f(y/t),  such  as,  for  example,  the  equation  dy / dt  =  sin(y / 1).  Differential  equa¬ 
tions  of  this  form  are  called  homogeneous  equations.  Since  the  right-hand  side 
only  depends  on  the  single  variable  y  / 1,  it  suggests  itself  to  make  the  substitu¬ 
tion  7 /  t  =  v  or  7  =  tv. 

(a)  Show  that  this  substitution  replaces  the  equation  dy / dt=  f(y / 1)  by  the 
equivalent  equation  tdv/dt  +  u=/(u),  which  is  separable. 

(b)  Find  the  general  solution  of  the  equation  dy  /  dt  =  2(y  / 1)  +  (y  / 1)2. 

14.  Determine  whether  each  of  the  following  functions  of  t  and  y  can  be  expressed 
as  a  function  of  the  single  variable  y / 1. 

y2  +  2ty 


(a) 


(c) 


(e) 


r 

>>3+ t3 

t2+y3 

„l+y 


(b) 


y3+,3 

yt2+y3 


t  +7 

(d)  \ny  —  \nt  +  — 


y 


,*-y 


.  t+y 
(g)  sin  — 


(f)  In VT+7  —In V/— 7 

(t2  +  lty  +  972)1/2 


y 


00 


3  /  +  57 


15.  Solve  the  initial-value  problem  t{dy  /  dt)  =  y  4-  \r+y2  ,  7(1)  =  0. 
Find  the  general  solution  of  the  following  differential  equations. 


16.  2ty^-  =3y2-  c2 


17.  (t-VJy)^=y 


18. 


dy_  =  t+y 
dt  t  —7 


dy 


19.  et/y(y-t)-^+y(\  +  et/y)  =  0 


Hint 


v  —  1 


ve  l/v  +  v2 


dv  —  ln(l  +  vel/v) 
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20.  Consider  the  differential  equation 

dy  _  t+y  +  1 

dt  t-y  +  3  '  1  1 

We  could  solve  this  equation  if  the  constants  1  and  3  were  not  present.  To 
eliminate  these  constants,  we  make  the  substitution  t  =  T+  h,  y  =  Y+k. 

(a)  Determine  h  and  k  so  that  (*)  can  be  written  in  the  form  dY / dT — 
(7+  Y)/(T  —  Y). 

(b)  Find  the  general  solution  of  (*).  (See  Exercise  18). 

21.  (a)  Prove  that  the  differential  equation 

dy  at  +  by  +  m 
dt  ct  +  dy  +  n 

where  a,  b ,  c,  d ,  m ,  and  n  are  constants,  can  always  be  reduced  to  dy  /  dt  — 
(at  +  by) / {ct  +  dy)  if  ad—bc=£ 0. 

(b)  Solve  the  above  equation  in  the  special  case  that  ad  =  be. 

Find  the  general  solution  of  the  following  equations. 

22.  (1  +  t-2y)  +  (4t  —  3y  —  6)dy/dt  =  0 

23.  (t  +  2y  +  2>)  +  (2t  +  4y  -  \  ) dy / dt  =  0 


1.5  Population  models 

In  this  section  we  will  study  first-order  differential  equations  which  govern 
the  growth  of  various  species.  At  first  glance  it  would  seem  impossible  to 
model  the  growth  of  a  species  by  a  differential  equation  since  the  popula¬ 
tion  of  any  species  always  changes  by  integer  amounts.  Hence  the  popula¬ 
tion  of  any  species  can  never  be  a  differentiable  function  of  time.  How¬ 
ever,  if  a  given  population  is  very  large  and  it  is  suddenly  increased  by  one, 
then  the  change  is  very  small  compared  to  the  given  population.  Thus,  we 
make  the  approximation  that  large  populations  change  continuously  and 
even  differentiably  with  time. 

Let p(t)  denote  the  population  of  a  given  species  at  time  t  and  let  r(t,p) 
denote  the  difference  between  its  birth  rate  and  its  death  rate.  If  this 
population  is  isolated,  that  is,  there  is  no  net  immigration  or  emigration, 
then  dp / dt,  the  rate  of  change  of  the  population,  equals  rp(t).  In  the  most 
simplistic  model  we  assume  that  r  is  constant,  that  is,  it  does  not  change 
with  either  time  or  population.  Then,  we  can  write  down  the  following  dif¬ 
ferential  equation  governing  population  growth: 


, 


a  =  constant. 
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This  is  a  linear  equation  and  is  known  as  the  Malthusian  law  of  population 
growth.  If  the  population  of  the  given  species  is  p0  at  time  t0 ,  then  p{t) 
satisfies  the  initial-value  problem  dp(t)/ dt  =  ap(t),  p(t0)=p0.  The  solution 
of  this  initial- value  problem  is p(t)=p0ea^~to\  Hence  any  species  satisfying 
the  Malthusian  law  of  population  growth  grows  exponentially  with  time. 

Now,  we  have  just  formulated  a  very  simple  model  for  population 
growth;  so  simple,  in  fact,  that  we  have  been  able  to  solve  it  completely  in  a 
few  lines.  It  is  important,  therefore,  to  see  if  this  model,  with  its  simplicity, 
has  any  relationship  at  all  with  reality.  Let  p(t)  denote  the  human  popula¬ 
tion  of  the  earth  at  time  t.  It  was  estimated  that  the  earth’s  human 
population  was  increasing  at  an  average  rate  of  2%  per  year  during  the 
period  1960-1970.  Let  us  start  in  the  middle  of  this  decade  on  January  1, 
1965,  at  which  time  the  U.S.  Department  of  Commerce  estimated  the  earth’s 
population  to  be  3.34  billion  people.  Then,  t0  =  1965,  /?0  =  3.34X  109  and 
a  —  .02,  so  that 

p(t)  =  (3.34)10VO2('_1965). 

One  way  of  checking  the  accuracy  of  this  formula  is  to  compute  the  time 
required  for  the  population  of  the  earth  to  double,  and  then  compare  it  to 
the  observed  value  of  35  years.  Our  formula  predicts  that  the  population  of 
the  earth  will  double  every  T  years,  where 

e02r=2. 

Taking  logarithms  of  both  sides  of  this  equation  gives  .0277  =  ln2,  so  that 

T—  50 In 2  ^  34.6  years. 

This  is  in  excellent  agreement  with  the  observed  value.  On  the  other  hand, 
though,  let  us  look  into  the  distant  future.  Our  equation  predicts  that  the 
earth’s  population  will  be  200,000  billion  in  the  year  2515,  1,800,000  billion 
in  the  year  2625,  and  3,600,000  billion  in  the  year  2660.  These  are  astro¬ 
nomical  numbers  whose  significance  is  difficult  to  gauge.  The  total  surface 
of  this  planet  is  approximately  1,860,000  billion  square  feet.  Eighty  percent 
of  this  surface  is  covered  by  water.  Assuming  that  we  are  willing  to  live  on 
boats  as  well  as  land,  it  is  easy  to  see  that  by  the  year  2515  there  will  be  only 
9.3  square  feet  per  person;  by  2625  each  person  will  have  only  one  square 
foot  on  which  to  stand;  and  by  2660  we  will  be  standing  two  deep  on  each 
other’s  shoulders. 

It  would  seem  therefore,  that  this  model  is  unreasonable  and  should  be 
thrown  out.  However,  we  cannot  ignore  the  fact  that  it  offers  exceptional 
agreement  in  the  past.  Moreover,  we  have  additional  evidence  that  popula¬ 
tions  do  grow  exponentially.  Consider  the  Microtus  Arvallis  Pall,  a  small 
rodent  which  reproduces  very  rapidly.  We  take  the  unit  of  time  to  be  a 
month,  and  assume  that  the  population  is  increasing  at  the  rate  of  40%  per 
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month.  If  there  are  two  rodents  present  initially  at  time  /  =  0,  then p(t ),  the 
number  of  rodents  at  time  t,  satisfies  the  initial-value  problem 

dp{t)/ dt  =  0Ap,  /?(0)  =  2. 

Consequently, 


p(t)  =  2e0A‘ .  (1) 

Table  1  compares  the  observed  population  with  the  population  calculated 
from  Equation  (1). 


Table  1.  The  growth  of  Microtus  Arvallis  Pall. 


Months 

0 

2 

6 

10 

p  Observed 

2 

5 

20 

109 

p  Calculated 

2 

4.5 

22 

109.1 

As  one  can  see,  there  is  excellent  agreement. 

Remark.  In  the  case  of  the  Microtus  Arvallis  Pall,  p  observed  is  very  ac¬ 
curate  since  the  pregnancy  period  is  three  weeks  and  the  time  required  for 
the  census  taking  is  considerably  less.  If  the  pregnancy  period  were  very 
short  then  p  observed  could  not  be  accurate  since  many  of  the  pregnant  ro¬ 
dents  would  have  given  birth  before  the  census  was  completed. 

The  way  out  of  our  dilemma  is  to  observe  that  linear  models  for  popula¬ 
tion  growth  are  satisfactory  as  long  as  the  population  is  not  too  large. 
When  the  population  gets  extremely  large  though,  these  models  cannot  be 
very  accurate,  since  they  do  not  reflect  the  fact  that  individual  members 
are  now  competing  with  each  other  for  the  limited  living  space,  natural  re¬ 
sources  and  food  available.  Thus,  we  must  add  a  competition  term  to  our 
linear  differential  equation.  A  suitable  choice  of  a  competition  term  is 
—  bp2,  where  b  is  a  constant,  since  the  statistical  average  of  the  number  of 
encounters  of  two  members  per  unit  time  is  proportional  to  p2.  We  con¬ 
sider,  therefore,  the  modified  equation 


dp 

dt 


=  ap  —  bp2. 


This  equation  is  known  as  the  logistic  law  of  population  growth  and  the 
numbers  a,b  are  called  the  vital  coefficients  of  the  population.  It  was  first 
introduced  in  1837  by  the  Dutch  mathematical-biologist  Verhulst.  Now, 
the  constant  b ,  in  general,  will  be  very  small  compared  to  a,  so  that  if  p  is 
not  too  large  then  the  term  —  bp2  will  be  negligible  compared  to  ap  and  the 
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population  will  grow  exponentially.  However,  when  p  is  very  large,  the 
term  —  bp2  is  no  longer  negligible,  and  thus  serves  to  slow  down  the  rapid 
rate  of  increase  of  the  population.  Needless  to  say,  the  more  industrialized 
a  nation  is,  the  more  living  space  it  has,  and  the  more  food  it  has,  the 
smaller  the  coefficient  b  is. 

Let  us  now  use  the  logistic  equation  to  predict  the  future  growth  of  an 
isolated  population.  If  p0  is  the  population  at  time  t0 ,  then  p(t ),  the  popula¬ 
tion  at  time  /,  satisfies  the  initial-value  problem 

dp  ~ 

-  =  ap-bpz,  p(t0)=p0. 


This  is  a  separable  differential  equation,  and  from  Equation  (10),  Section 
1.4, 

p  dr 


r 

JPo 


ar  —  br 2 


=  f  ds=  t  —  t0. 

Jto 


To  integrate  the  function  1  /(ar—br2)  we  resort  to  partial  fractions.  Let 

1  _  1  -A\  B 

ar-br2  r(a-br)  r  a-br' 


To  find  A  and  B ,  observe  that 


A 

r 


+ 


B 

a  — br 


A  (a  —  br)  +  Br 
r(a  —  br) 


Aa  +  (B  —  bA)r 
r(a  —  br) 


Therefore,  Aa  +  (B  —  bA)r  =  1.  Since  this  equation  is  true  for  all  values  of  r, 
we  see  that  Aa  =  1  and  B-bA—  0.  Consequently,  A  =  \/ a,  B  =  b/a,  and 


fV-- . 1  ni + -L-)* 

r(a-br)  a  )  \  r  a-br ) 

Po  v  '  J Po 


In—  +ln 

a  — bp0 

=  —In— 

ci-bp0 

a 

Po 

a  — bp 

a  Po 

a  — bp 

Thus, 


a(t— 


P_ 

Po 


a-  bPo 

a  — bp 


Now,  it  is  a  simple  matter  to  show  (see  Exercise  1)  that 

a-t>Po 
a  —  bp(t) 


is  always  positive.  Hence, 


a  ~  bp0 
a  —  bp 
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Taking  exponentials  of  both  sides  of  this  equation  gives 

p  a-bp0 


a(t-t0)  _ 


or 


Po  a~bP  ’ 
p0(a-bp)eaU~'<>)  =  (a-bp0)p. 


Bringing  all  terms  involving  p  to  the  left-hand  side  of  this  equation,  we  see 
that 


—  bp0+  bp0ea^  p(t)  =  ap0ea(<t  to). 


Consequently, 


p(  o  = 


ap0e 


a(t~  t0 ) 


<*P  o 


a  —  bp0+  bp0ea^  bp0  +  (a-  bp0)e  a /o) 


(3) 


Let  us  now  examine  Equation  (3)  to  see  what  kind  of  population  it  pre¬ 
dicts.  Observe  that  as  t-+  oo, 


MO 


aPo  =  a 
bPo  b  ’ 


Thus,  regardless  of  its  initial  value ,  the  population  always  approaches  the 
limiting  value  a/b.  Next,  observe  that  p(t)  is  a  monotonically  increasing 
function  of  time  if  0 <p0<a/b.  Moreover,  since 


d2p  dp  dp 

-2bPTt={a-lbP)P(“-bp), 


we  see  that  dp /dt  is  increasing  if  p(t)<a/2b,  and  that  dp / dt  is  decreasing 
if  p(t)>  a /lb.  Hence,  if  p0<a/2b,  the  graph  of  p(t)  must  have  the  form 
given  in  Figure  1.  Such  a  curve  is  called  a  logistic,  or  S-shaped  curve. 
From  its  shape  we  conclude  that  the  time  period  before  the  population 
reaches  half  its  limiting  value  is  a  period  of  accelerated  growth.  After  this 
point,  the  rate  of  growth  decreases  and  in  time  reaches  zero.  This  is  a 
period  of  diminishing  growth. 

These  predictions  are  borne  out  by  an  experiment  on  the  protozoa 
Paramecium  caudatum  performed  by  the  mathematical  biologist  G.  F. 
Gause.  Five  individuals  of  Paramecium  were  placed  in  a  small  test  tube 
containing  0.5  cm3  of  a  nutritive  medium,  and  for  six  days  the  number  of 
individuals  in  every  tube  was  counted  daily.  The  Paramecium  were  found 
to  increase  at  a  rate  of  230.9%  per  day  when  their  numbers  were  low.  The 
number  of  individuals  increased  rapidly  at  first,  and  then  more  slowly,  un¬ 
til  towards  the  fourth  day  it  attained  a  maximum  level  of  375,  saturating 
the  test  tube.  From  this  data  we  conclude  that  if  the  Paramecium  cauda¬ 
tum  grow  according  to  the  logistic  law  dp/ dt  =  ap  —  bp 2,  then  a  =  2.309  and 
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Figure  1.  Graph  of  p(t) 


b  =  2.309/375.  Consequently,  the  logistic  law  predicts  that 


p(  o= 


(2.309)5 


(2.309)5 
375 

375 

1  +  74e-2'309' 


(2.309)5  \  ,  0An 

+  12.309-  3?5  jg-2-309' 


(We  have  taken  the  initial  time  t0  to  be  0.)  Figure  2  compares  the  graph  of 
p{t)  predicted  by  Equation  (4)  with  the  actual  measurements,  which  are  de¬ 
noted  by  o.  As  can  be  seen,  the  agreement  is  remarkably  good. 

In  order  to  apply  our  results  to  predict  the  future  human  population  of 
the  earth,  we  must  estimate  the  vital  coefficients  a  and  b  in  the  logistic 
equation  governing  its  growth.  Some  ecologists  have  estimated  that  the  na¬ 
tural  value  of  a  is  0.029.  We  also  know  that  the  human  population  was  in¬ 
creasing  at  the  rate  of  2%  per  year  when  the  population  was  (3.34)  109. 
Since  (1  /p)(dp/dt)  =  a  -  bp,  we  see  that 

0.02  =  a-b  (3.34)1 09. 

Consequently,  b  —  2.695  X  10” 12 .  Thus,  according  to  the  logistic  law  of 
population  growth,  the  human  population  of  the  earth  will  tend  to  the 
limiting  value  of 

t-  — - -"‘0^ — —  =  10.76  billion  people 

b  2.695X10“ 12 

Note  that  according  to  this  prediction,  we  were  still  on  the  accelerated 
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growth  portion  of  the  logistic  curve  in  1965,  since  we  had  not  yet  attained 
half  the  limiting  population  predicted  for  us. 

Remark.  A  student  of  mine  once  suggested  that  we  use  Equation  (3)  to  find 
the  time  t  at  which p(t)  =  2,  and  then  we  can  deduce  how  long  ago  mankind 
appeared  on  earth.  On  the  surface  this  seems  like  a  fantastic  idea.  However, 
we  cannot  travel  that  far  backwards  into  the  past,  since  our  model  is  no 
longer  accurate  when  the  population  is  small. 

As  another  verification  of  the  validity  of  the  logistic  law  of  population 
growth,  we  consider  the  equation 


197,273,000 

1  +  g  ~  0  03 1 34(  /  —  19 1 3.25) 


which  was  introduced  by  Pearl  and  Reed  as  a  model  of  the  population 
growth  of  the  United  States.  First,  using  the  census  figures  for  the  years 
1790,  1850,  and  1910,  Pearl  and  Reed  found  from  (3)  (see  Exercise  2a)  that 
a  =  0.03134  and  Z?  =  (1.5887)10~10.  Then  (see  Exercise  2b),  Pearl  and  Reed 
calculated  that  the  population  of  the  United  States  reached  half  its  limiting 
population  of  a/b  =  197,273,000  in  April  1913.  Consequently  (see  Exercise 
2c),  we  can  rewrite  (3)  in  the  simpler  form  (5). 

Table  2  below  compares  Pearl  and  Reed’s  predictions  with  the  observed 
values  of  the  population  of  the  United  States.  These  results  are  remarkable, 
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Table  2.  Population  of  the  U.S.  from  1790-1950.  (The  last  4  entries 
were  added  by  the  Dartmouth  College  Writing  Group.) 


Actual 

Predicted 

Error 

% 

1790 

3,929,000 

3,929,000 

0 

0.0 

1800 

5,308,000 

5,336,000 

28,000 

0.5 

1810 

7,240,000 

7,228,000 

-  12,000 

-0.2 

1820 

9,638,000 

9,757,000 

119,000 

1.2 

1830 

12,866,000 

13,109,000 

243,000 

1.9 

1840 

17,069,000 

17,506,000 

437,000 

2.6 

1850 

23,192,000 

23,192,000 

0 

0.0 

1860 

31,443,000 

30,412,000 

-1,031,000 

-3.3 

1870 

38,558,000 

39,372,000 

814,000 

2.1 

1880 

50,156,000 

50,177,000 

21,000 

0.0 

1890 

62,948,000 

62,769,000 

- 179,000 

-0.3 

1900 

75,995,000 

76,870,000 

875,000 

1.2 

1910 

91,972,000 

91,972,000 

0 

0.0 

1920 

105,711,000 

107,559,000 

1,848,000 

1.7 

1930 

122,775,000 

123,124,000 

349,000 

0.3 

1940 

131,669,000 

136,653,000 

4,984,000 

3.8 

1950 

150,697,000 

149,053,000 

-  1,644,000 

-1.1 

especially  since  we  have  not  taken  into  account  the  large  waves  of  im¬ 
migration  into  the  United  States,  and  the  fact  that  the  United  States  was 
involved  in  five  wars  during  this  period. 

In  1845  Verhulst  prophesied  a  maximum  population  for  Belgium  of 
6,600,000,  and  a  maximum  population  for  France  of  40,000,000.  Now,  the 
population  of  Belgium  in  1930  was  already  8,092,000.  This  large  dis¬ 
crepancy  would  seem  to  indicate  that  the  logistic  law  of  population  growth 
is  very  inaccurate,  at  least  as  far  as  the  population  of  Belgium  is  con¬ 
cerned.  However,  this  discrepancy  can  be  explained  by  the  astonishing  rise 
of  industry  in  Belgium,  and  by  the  acquisition  of  the  Congo  which  secured 
for  the  country  sufficient  additional  wealth  to  support  the  extra  popula¬ 
tion.  Thus,  after  the  acquisition  of  the  Congo,  and  the  astonishing  rise  of 
industry  in  Belgium,  Verhulst  should  have  lowered  the  vital  coefficient  b. 

On  the  other  hand,  the  population  of  France  in  1930  was  in  remarkable 
agreement  with  Verhulst’s  forecast.  Indeed,  we  can  now  answer  the  follow¬ 
ing  tantalizing  paradox:  Why  was  the  population  of  France  increasing 
extremely  slowly  in  1930  while  the  French  population  of  Canada  was  in¬ 
creasing  very  rapidly?  After  all,  they  are  the  same  people!  The  answer  to 
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this  paradox,  of  course,  is  that  the  population  of  France  in  1930  was  very 
near  its  limiting  value  and  thus  was  far  into  the  period  of  diminishing 
growth,  while  the  population  of  Canada  in  1930  was  still  in  the  period  of 
accelerated  growth. 

Remark  1.  It  is  clear  that  technological  developments,  pollution  considera¬ 
tions  and  sociological  trends  have  significant  influence  on  the  vital  coef¬ 
ficients  a  and  b.  Therefore,  they  must  be  re-evaluated  every  few  years. 

Remark  2.  To  derive  more  accurate  models  of  population  growth,  we 
should  not  consider  the  population  as  made  up  of  one  homogeneous  group 
of  individuals.  Rather,  we  should  subdivide  it  into  different  age  groups.  We 
should  also  subdivide  the  population  into  males  and  females,  since  the  re¬ 
production  rate  in  a  population  usually  depends  more  on  the  number  of 
females  than  on  the  number  of  males. 


Remark  3.  Perhaps  the  severest  criticism  leveled  at  the  logistic  law  of 
population  growth  is  that  some  populations  have  been  observed  to 
fluctuate  periodically  between  two  values,  and  any  type  of  fluctuation  is 
ruled  out  in  a  logistic  curve.  However,  some  of  these  fluctuations  can  be 
explained  by  the  fact  that  when  certain  populations  reach  a  sufficiently 
high  density,  they  become  susceptible  to  epidemics.  The  epidemic  brings 
the  population  down  to  a  lower  value  where  it  again  begins  to  increase, 
until  when  it  is  large  enough,  the  epidemic  strikes  again.  In  Exercise  10  we 
derive  a  model  to  describe  this  phenomenon,  and  we  apply  this  model  in 
Exercise  1 1  to  explain  the  sudden  appearance  and  disappearance  of  hordes 
of  small  rodents. 


Epilog.  The  following  article  appeared  in  the  New  York  Times  on  March 
26,  1978,  and  was  authored  by  Nick  Eberstadt. 

The  gist  of  the  following  article  is  that  it  is  very  difficult,  using  statistical 
methods  alone,  to  make  accurate  population  projections  even  30  years  into 
the  future.  In  1970,  demographers  at  the  United  Nations  projected  the 
population  of  the  earth  to  be  6.5  billion  people  by  the  year  2000.  Only  six 
years  later,  this  forecast  was  revised  downward  to  5.9  billion  people. 

Let  us  now  use  Equation  (3)  to  predict  the  population  of  the  earth  in  the 
year  2000.  Setting  a  =  .029,  b  =  2.695  X  10“ 12,/?0  =  3.34X  109,  t0  =  1965,  and 
t  —  2,000  gives 


/>(2000)  = 


(.029)(3.34)109 
.009  +  (.02)  029)35 

29(3.34)  9 

9+20<T1015 


=  5.96  billion  people! 


This  is  another  spectacular  application  of  the  logistic  equation. 
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Exercises 

1.  Prove  that  (a  —  bpQ)/(a  -  bp(t ))  is  positive  for  t0  <  t  <  oo.  Hint :  Use  Equation  (2) 
to  show  that  p(t)  can  never  equal  a/b  if  p0=£  a/ b. 

2.  (a)  Choose  3  times  t0 ,  tx,  and  t2,  with  tx  — 10=  t2—  tx.  Show  that  (3)  determines  a 

and  b  uniquely  in  terms  of  t0,p(t0),  tx,p(tx),  t2,  and 

(b)  Show  that  the  period  of  accelerated  growth  for  the  United  States  ended  in 
April,  1913. 

(c)  Let  a  population  p(t)  grow  according  to  the  logistic  law  (3),  and  let  t  be  the 
time  at  which  half  the  limiting  population  is  achieved.  Show  that 


a/b 

1 +£>"*('■') 


3.  In  1879  and  1881  a  number  of  yearling  bass  were  seined  in  New  Jersey,  taken 
across  the  continent  in  tanks  by  train,  and  planted  in  San  Francisco  Bay.  A  total 
of  only  435  Striped  Bass  survived  the  rigors  of  these  two  trips.  Yet,  in  1899,  the 
commercial  net  catch  alone  was  1,234,000  pounds.  Since  the  growth  of  this 
population  was  so  fast,  it  is  reasonable  to  assume  that  it  obeyed  the  Malthusian 
law  dp/ dt  =  ap.  Assuming  that  the  average  weight  of  a  bass  fish  is  three  pounds, 
and  that  in  1899  every  tenth  bass  fish  was  caught,  find  a  lower  bound  for  a. 

4.  Suppose  that  a  population  doubles  its  original  size  in  100  years,  and  triples  it  in 
200  years.  Show  that  this  population  cannot  satisfy  the  Malthusian  law  of 
population  growth. 

5.  Assume  that  p{t)  satisfies  the  Malthusian  law  of  population  growth.  Show  that 
the  increases  in  p  in  successive  time  intervals  of  equal  duration  form  the  terms  of 
a  geometric  progression.  This  is  the  source  of  Thomas  Malthus’  famous  dictum 
“Population  when  unchecked  increases  in  a  geometrical  ratio.  Subsistence 
increases  only  in  an  arithmetic  ratio.  A  slight  acquaintance  with  numbers  will 
show  the  immensity  of  the  first  power  in  comparison  of  the  second.” 

6.  A  population  grows  according  to  the  logistic  law,  with  a  limiting  population  of 
5  X  105 6 7 8  individuals.  When  the  population  is  low  it  doubles  every  40  minutes. 
What  will  the  population  be  after  two  hours  if  initially  it  is  (a)  108,  (b)  109? 

7.  A  family  of  salmon  fish  living  off  the  Alaskan  Coast  obeys  the  Malthusian  law 
of  population  growth  dp(t)/dt  =  0.003p(t),  where  t  is  measured  in  minutes.  At 
time  /  =  0  a  group  of  sharks  establishes  residence  in  these  waters  and  begins 
attacking  the  salmon.  The  rate  at  which  salmon  are  killed  by  the  sharks  is 

0.00  \p 2(t),  where  p(t)  is  the  population  of  salmon  at  time  t.  Moreover,  since  an 
undesirable  element  has  moved  into  their  neighborhood,  0.002  salmon  per 
minute  leave  the  Alaskan  waters. 

(a)  Modify  the  Malthusian  law  of  population  growth  to  take  these  two  factors 
into  account. 
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(b)  Assume  that  at  time  /  =  0  there  are  one  million  salmon.  Find  the  population 
p(t).  What  happens  as  /->oo? 

(c)  Show  that  the  above  model  is  really  absurd.  Hint:  Show,  according  to  this 
model,  that  the  salmon  population  decreases  from  one  million  to  about  one 
thousand  in  one  minute. 

8.  The  population  of  New  York  City  would  satisfy  the  logistic  law 

=  J_  1  2 

dt  25  p  (25)  106  P 

where  /  is  measured  in  years,  if  we  neglected  the  high  emigration  and  homicide 
rates. 

(a)  Modify  this  equation  to  take  into  account  the  fact  that  9,000  people  per  year 
move  from  the  city,  and  1,000  people  per  year  are  murdered. 

(b)  Assume  that  the  population  of  New  York  City  was  8,000,000  in  1970.  Find 
the  population  for  all  future  time.  What  happens  as  /— >  oo? 

9.  An  initial  population  of  50,000  inhabits  a  microcosm  with  a  carrying  capacity  of 
100,000.  After  five  years,  the  population  has  increased  to  60,000.  Show  that  the 
natural  growth  rate  a  for  this  population  is  (l/5)ln3/2. 

10.  We  can  model  a  population  which  becomes  susceptible  to  epidemics  in  the 
following  manner.  Assume  that  our  population  is  originally  governed  by  the  lo¬ 
gistic  law 

dP  ,  2  r\ 

~dt  ~aP~  bp2  (0 

and  that  an  epidemic  strikes  as  soon  as  p  reaches  a  certain  value  Q,  with  Q  less 
than  the  limiting  population  a/b.  At  this  stage  the  vital  coefficients  become 
A  <a,  B  <b,  and  Equation  (i)  is  replaced  by 

^ =Ap-  Bp2.  (ii) 

Suppose  that  Q  >  A/B.  The  population  then  starts  decreasing.  A  point  is 
reached  when  the  population  falls  below  a  certain  value  q>A/B.  At  this  mo¬ 
ment  the  epidemic  ceases  and  the  population  again  begins  to  grow  following 
Equation  (i),  until  the  incidence  of  a  fresh  epidemic.  In  this  way  there  are  peri¬ 
odic  fluctuations  of  p  between  q  and  Q.  We  now  indicate  how  to  calculate  the 
period  T  of  these  fluctuations. 

(a)  Show  that  the  time  Tx  taken  by  the  first  part  of  the  cycle,  when  p  increases 
from  q  to  Q  is  given  by 

r  1.  Q(a-bq) 

1  a  n q(a-bQ) 

(b)  Show  that  the  time  T2  taken  by  the  second  part  of  the  cycle,  when  p  de¬ 
creases  from  Q  to  q  is  given  by 

r2=l \nqi"fB  A)  . 

2  A  Q(qB-A) 

Thus,  the  time  for  the  entire  cycle  is  T\  +  T2. 
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11.  It  has  been  observed  that  plagues  appear  in  mice  populations  whenever  the 
population  becomes  too  large.  Further,  a  local  increase  of  density  attracts  preda¬ 
tors  in  large  numbers.  These  two  factors  will  succeed  in  destroying  97-98%  of  a 
population  of  small  rodents  in  two  or  three  weeks,  and  the  density  then  falls  to  a 
level  at  which  the  disease  cannot  spread.  The  population,  reduced  to  2%  of  its 
maximum,  finds  its  refuges  from  the  predators  sufficient,  and  its  food  abundant. 
The  population  therefore  begins  to  grow  again  until  it  reaches  a  level  favorable 
to  another  wave  of  disease  and  predation.  Now,  the  speed  of  reproduction  in 
mice  is  so  great  that  we  may  set  b  =  0  in  Equation  (i)  of  Exercise  7.  In  the  second 
part  of  the  cycle,  on  the  contrary,  A  is  very  small  in  comparison  with  B ,  and  it 
may  be  neglected  therefore  in  Equation  (ii). 

(a)  Under  these  assumptions,  show  that 


and 


Q~q 

qQB  * 


(b)  Assuming  that  Tx  is  approximately  four  years,  and  Q/q  is  approximately 
fifty,  show  that  a  is  approximately  one.  This  value  of  a,  incidentally,  corre¬ 
sponds  very  well  with  the  rate  of  multiplication  of  mice  in  natural  circums¬ 
tances. 


12.  There  are  many  important  classes  of  organisms  whose  birth  rate  is  not  propor¬ 
tional  to  the  population  size.  Suppose,  for  example,  that  each  member  of  the 
population  requires  a  partner  for  reproduction,  and  that  each  member  relies  on 
chance  encounters  for  meeting  a  mate.  If  the  expected  number  of  encounters  is 
proportional  to  the  product  of  the  numbers  of  males  and  females,  and  if  these 
are  equally  distributed  in  the  population,  then  the  number  of  encounters,  and 
hence  the  birthrate  too,  is  proportional  to  p2.  The  death  rate  is  still  proportional 
to  p.  Consequently,  the  population  size  p(t)  satisfies  the  differential  equation 

dp  0 

=  bp—  ap,  a,b>0. 

Show  that  p{t)  approaches  0  as  t-*oo  if  p0<  a/b.  Thus,  once  the  population  size 
drops  below  the  critical  size  a/b ,  the  population  tends  to  extinction.  Thus,  a 
species  is  classified  endangered  if  its  current  size  is  perilously  close  to  its  critical 
size. 


1.6  The  spread  of  technological  innovations 

Economists  and  sociologists  have  long  been  concerned  with  how  a  techno¬ 
logical  change,  or  innovation,  spreads  in  an  industry.  Once  an  innovation 
is  introduced  by  one  firm,  how  soon  do  others  in  the  industry  come  to 
adopt  it,  and  what  factors  determine  how  rapidly  they  follow?  In  this  sec¬ 
tion  we  construct  a  model  of  the  spread  of  innovations  among  farmers, 
and  then  show  that  this  same  model  also  describes  the  spread  of  innova¬ 
tions  in  such  diverse  industries  as  bituminous  coal,  iron  and  steel,  brewing, 
and  railroads. 

Assume  that  a  new  innovation  is  introduced  into  a  fixed  community  of 
N  farmers  at  time  t  =  0.  Let  p(t)  denote  the  number  of  farmers  who  have 
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adopted  at  time  t.  As  in  the  previous  section,  we  make  the  approximation 
that p(t)  is  a  continuous  function  of  time,  even  though  it  obviously  changes 
by  integer  amounts.  The  simplest  realistic  assumption  that  we  can  make 
concerning  the  spread  of  this  innovation  is  that  a  farmer  adopts  the  in¬ 
novation  only  after  he  has  been  told  of  it  by  a  farmer  who  has  already 
adopted.  Then,  the  number  of  farmers  A/7  who  adopt  the  innovation  in  a 
small  time  interval  A  Ms  directly  proportional  to  the  number  of  farmers  p 
who  have  already  adopted,  and  the  number  of  farmers  N—p  who  are  as 
yet  unaware.  Hence,  kp  =  cp(N -p)kt  or  A/?/ &t  =  cp(N —p)  for  some  posi¬ 
tive  constant  c.  Letting  A/— >0,  we  obtain  the  differential  equation 

%  =  cP(N-p).  (1) 


This  is  the  logistic  equation  of  the  previous  section  if  we  set  a  =  cN ,  b  =  c. 
Assuming  that  /?(0)=  1;  i.e.,  one  farmer  has  adopted  the  innovation  at  time 
/  =  0,  we  see  that  p(t)  satisfies  the  initial-value  problem 


dp 

dt 

The  solution  of  (2)  is 


cp{N-p ),  p  (0) =  1 . 


NecNt 

N- \  +  ecNt 


(2) 

(3) 


which  is  a  logistic  function  (see  Section  1.5).  Hence,  our  model  predicts 
that  the  adoption  process  accelerates  up  to  that  point  at  which  half  the 
community  is  aware  of  the  innovation.  After  this  point,  the  adoption  pro¬ 
cess  begins  to  decelerate  until  it  eventually  reaches  zero. 

Let  us  compare  the  predictions  of  our  model  with  data  on  the  spread  of 
two  innovations  through  American  farming  communities  in  the  middle 
1950’s.  Figure  1  represents  the  cumulative  number  of  farmers  in  Iowa 
during  1944-1955  who  adopted  2,4-D  weed  spray,  and  Figure  2  represents 
the  cumulative  percentage  of  corn  acreage  in  hybrid  corn  in  three  Ameri¬ 
can  states  during  the  years  1934-1958.  The  circles  in  these  figures  are  the 
actual  measurements,  and  the  graphs  were  obtained  by  connecting  these 
measurements  with  straight  lines.  As  can  be  seen,  these  curves  have  all  the 
properties  of  logistic  curves,  and  on  the  whole,  offer  very  good  agreement 
with  our  model.  However,  there  are  two  discrepancies.  First,  the  actual 
point  at  which  the  adoption  process  ceases  to  accelerate  is  not  always  when 
fifty  per  cent  of  the  population  has  adopted  the  innovation.  As  can  be  seen 
from  Figure  2,  the  adoption  process  for  hybrid  corn  began  to  decelerate  in 
Alabama  only  after  nearly  sixty  per  cent  of  the  farmers  had  adopted  the 
innovation.  Second,  the  agreement  with  our  model  is  much  better  in  the 
later  stages  of  the  adoption  process  than  in  the  earlier  stages. 

The  source  of  the  second  discrepancy  is  our  assumption  that  a  farmer 
only  learns  of  an  innovation  through  contact  with  another  farmer.  This  is 
not  entirely  true.  Studies  have  shown  that  mass  communication  media  such 
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as  radio,  television,  newspapers  and  farmers’  magazines  play  a  large  role  in 
the  early  stages  of  the  adoption  process.  Therefore,  we  must  add  a  term  to 
the  differential  equation  (1)  to  take  this  into  account.  To  compute  this 
term,  we  assume  that  the  number  of  farmers  Ap  who  learn  of  the  innova¬ 
tion  through  the  mass  communication  media  in  a  short  period  of  time  A/  is 
proportional  to  the  number  of  farmers  who  do  not  yet  know;  i.e., 

kp  =  c'(N-p)kt 

for  some  positive  constant  cf.  Letting  Af-^0,  we  see  that  c'(N—p)  farmers, 
per  unit  time,  learn  of  the  innovation  through  the  mass  communication 
media.  Thus,  if  p(0)  =  0,  then  p(t)  satisfies  the  initial- value  problem 

^=cp(N-p)  +  c'(N-p),  p(  0)=0.  (4) 

The  solution  of  (4)  is 

Nc'[e(c'+cN)‘-\] 
cN+c'e(c'+cN)t  ’ 

and  in  Exercises  2  and  3  we  indicate  how  to  determine  the  shape  of  the 
curve  (5). 

The  corrected  curve  (5)  now  gives  remarkably  good  agreement  with  Fig¬ 
ures  1  and  2,  for  suitable  choices  of  c  and  c'.  However,  (see  Exercise  3c)  it 
still  doesn’t  explain  why  the  adoption  of  hybrid  corn  in  Alabama  only  be¬ 
gan  to  decelerate  after  sixty  per  cent  of  the  farmers  had  adopted  the  in¬ 
novation.  This  indicates,  of  course,  that  other  factors,  such  as  the  time  in¬ 
terval  that  elapses  between  when  a  farmer  first  learns  of  an  innovation  and 
when  he  actually  adopts  it,  may  play  an  important  role  in  the  adoption 
process,  and  must  be  taken  into  account  in  any  model. 

We  would  now  like  to  show  that  the  differential  equation 

dp/ dt  =  cp  (N—p) 

also  governs  the  rate  at  which  firms  in  such  diverse  industries  as  bi¬ 
tuminous  coal,  iron  and  steel,  brewing,  and  railroads  adopted  several 
major  innovations  in  the  first  part  of  this  century.  This  is  rather  surprising, 
since  we  would  expect  that  the  number  of  firms  adopting  an  innovation  in 
one  of  these  industries  certainly  depends  on  the  profitability  of  the  innova¬ 
tion  and  the  investment  required  to  implement  it,  and  we  haven’t  men¬ 
tioned  these  factors  in  deriving  Equation  (1).  However,  as  we  shall  see 
shortly,  these  two  factors  are  incorporated  in  the  constant  c. 

Let  n  be  the  total  number  of  firms  in  a  particular  industry  who  have 
adopted  an  innovation  at  time  /.  It  is  clear  that  the  number  of  firms  A p 
who  adopt  the  innovation  in  a  short  time  interval  A t  is  proportional  to  the 
number  of  firms  n  —p  who  have  not  yet  adopted;  i.e.,  A p  —  X(n  —p)kt.  Let¬ 
ting  A/— >0,  we  see  that 
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dp 

—  =X(n  —p). 
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The  proportionality  factor  A  depends  on  the  profitability  tt  of  installing  this 
innovation  relative  to  that  of  alternative  investments,  the  investment  s  re¬ 
quired  to  install  this  innovation  as  a  percentage  of  the  total  assets  of  the 
firm,  and  the  percentage  of  firms  who  have  already  adopted.  Thus, 


\=f(7r,s,p/n). 

Expanding  /  in  a  Taylor  series,  and  dropping  terms  of  degree  more  than 
two,  gives 

P  ?  7 

A  =  ax  +  a27r  +  a3s  +  aA  — h  a5  m  +  a6s  +  a77rs 


n 


+ 


In  the  late  1950’s,  Edwin  Mansfield  of  Carnegie  Mellon  University  in¬ 
vestigated  the  spread  of  twelve  innovations  in  four  major  industries.  From 
his  exhaustive  studies,  Mansfield  concluded  that  a10  =  0  and 

ax  -I-  a27T  +  a3s  +  a57r2  4-  a6s2  4-  a77rs  =  0. 

Thus,  setting 

k  =  a4+  4-  (6) 

we  see  that 

dP  V  P  (  ^ 

—  =  k  —  (n-p). 
dt  n  v 

(This  is  the  equation  obtained  previously  for  the  spread  of  innovations 
among  farmers,  if  we  set  k/n  =  c.)  We  assume  that  the  innovation  is  first 
adopted  by  one  firm  in  the  year  tQ .  Then,  p{t)  satisfies  the  initial-value 
problem 

dp  =-.P(n~P)-  p(t  o)=1  (7) 


and  this  implies  that 


dt  n 


p(‘)= 


n 


1  +  (n  —  l)e 


Mansfield  studied  how  rapidly  the  use  of  twelve  innovations  spread 
from  enterprise  to  enterprise  in  four  major  industries — bituminous  coal, 
iron  and  steel,  brewing,  and  railroads.  The  innovations  are  the  shuttle  car, 
trackless  mobile  loader,  and  continuous  mining  machine  (in  bituminous 
coal);  the  by-product  coke  oven,  continuous  wide  strip  mill,  and  continu¬ 
ous  annealing  line  for  tin  plate  (in  iron  and  steel);  the  pallet-loading 
machine,  tin  container,  and  high  speed  bottle  filler  (in  brewing);  and  the 
diesel  locomotive,  centralized  traffic  control,  and  car  retarders  (in  rail¬ 
roads).  His  results  are  described  graphically  in  Figure  3.  For  all  but  the 
by-product  coke  oven  and  tin  container,  the  percentages  given  are  for  ev¬ 
ery  two  years  from  the  year  of  initial  introduction.  The  length  of  the  inter¬ 
val  for  the  by-product  coke  oven  is  about  six  years,  and  for  the  tin  con¬ 
tainer,  it  is  six  months.  Notice  how  all  these  curves  have  the  general  ap¬ 
pearance  of  a  logistic  curve. 
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(C) 

Figure  3.  Growth  in  the  percentage  of  major  firms  that  introduced  twelve  in¬ 
novations;  bituminous  coal,  iron  and  steel,  brewing,  and  railroad  industries, 
1890-1958;  (a)  By-product  coke  oven  (CO),  diesel  locomotive  (DL),  tin  container 
(TC),  and  shuttle  car  (SC);  (b)  Car  retarder  (CR),  trackless  mobile  loader  (ML), 
continuous  mining  machine  (CM),  and  pallet-loading  machine  (PL);  (c)  Continu¬ 
ous  wide-strip  mill  (SM),  centralized  traffic  control  (CTC),  continuous  annealing 
(CA),  and  highspeed  bottle  filler  (BF). 
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Table  1. 


Innovation 

n 

*0 

a4 

^8 

a9 

7 T 

s 

Diesel  locomotive 
Centralized  traffic 

25 

1925 

-0.59 

0.530 

-0.027 

1.59 

0.015 

control 

24  1 

1926 

-0.59 

0.530 

-0.027 

1.48 

0.024 

Car  retarders 

25 

1924 

-0.59 

0.530 

-  0.027 

1.25 

0.785 

Continuous  wide 
strip  mill 

By-product  coke 

12 

1924 

-0.52 

! 

0.530 

-  0.027 

1.87 

4.908 

oven 

12 

1894 

-0.52 

0.530 

-  0.027 

1.47 

2.083 

Continuous  annealing 

9 

1936 

-0.52 

0.530 

-  0.027 

1.25 

0.554 

Shuttle  car 

Trackless  mobile 

15 

1937 

-0.57 

0.530 

-  0.027 

1.74 

0.013 

loader 

Continuous  mining 

15 

1934 

-0.57 

0.530 

-  0.027 

1.65 

0.019 

machine 

17 

1947 

-0.57 

0.530 

-  0.027 

2.00 

0.301 

Tin  container 

High  speed  bottle 

22 

1935 

-0.29 

0.530 

-  0.027 

5.07 

0.267 

filler 

Pallet-loading 

16 

1951 

-0.29 

0.530 

-  0.027 

1.20 

0.575 

i 

machine 

19 

1948 

-0.29 

0.530 

-  0.027 

1.67 

0.115 

For  a  more  detailed  comparison  of  the  predictions  of  our  model  (7)  with 
these  observed  results,  we  must  evaluate  the  constants  n,  k ,  and  t0  for  each 
of  the  twelve  innovations.  Table  1  gives  the  value  of  n ,  t0 ,  a4 ,  a5 ,  a9 ,  n r,  and 
,s  for  each  of  the  twelve  innovations;  the  constant  k  can  then  be  computed 
from  Equation  (6).  As  the  answers  to  Exercises  5  and  6  will  indicate,  our 
model  (7)  predicts  with  reasonable  accuracy  the  rate  of  adoption  of  these 
twelve  innovations. 
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Exercises 

1.  Solve  the  initial-value  problem  (2). 

2.  Let  c  =  0  in  (5).  Show  that  p(t)  increases  monotonically  from  0  to  TV,  and  has  no 
points  of  inflection. 

3.  Here  is  a  heuristic  argument  to  determine  the  behavior  of  the  curve  (5).  If  c'  =  0, 
then  we  have  a  logistic  curve,  and  if  c  =  0,  then  we  have  the  behavior  described 
in  Exercise  2.  Thus,  if  c  is  large  relative  to  c',  then  we  have  a  logistic  curve,  and 
if  c  is  small  relative  to  c'  then  we  have  the  behavior  illustrated  in  Exercise  2. 
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(a)  Let  p(t)  satisfy  (4).  Show  that 

d2p 


dv 


=  (N — p)(cp  +  c'){cN  —  2cp  —  c'). 


(b)  Show  that  p(t)  has  a  point  of  inflection,  at  which  dp / dt  achieves  a  maxi¬ 
mum,  if,  and  only  if,  c'/c<N. 

(c)  Assume  that  p(t)  has  a  point  of  inflection  at  t  =  t*.  Show  that  p(t*)  <  N / 2. 

4.  Solve  the  initial-value  problem  (7). 


5.  It  seems  reasonable  to  take  the  time  span  between  the  date  when  20%  of  the 
firms  had  introduced  the  innovation  and  the  date  when  80%  of  the  firms  had  in¬ 
troduced  the  innovation,  as  the  rate  of  imitation. 

(a)  Show  from  our  model  that  this  time  span  is  4(ln2)//:. 

(b)  For  each  of  the  twelve  innovations,  compute  this  time  span  from  the  data  in 
Table  1,  and  compare  with  the  observed  value  in  Figure  3. 

6.  (a)  Show  from  our  model  that  (\/k)\n(n—\)  years  elapse  before  50%  of  the 

firms  introduce  an  innovation. 

(b)  Compute  this  time  span  for  each  of  the  12  innovations  and  compare  with  the 
observed  values  in  Figure  3. 


1.7  An  atomic  waste  disposal  problem 

For  several  years  the  Atomic  Energy  Commission  (now  known  as  the 
Nuclear  Regulatory  Commission)  had  disposed  of  concentrated  radioac¬ 
tive  waste  material  by  placing  it  in  tightly  sealed  drums  which  were  then 
dumped  at  sea  in  fifty  fathoms  (300  feet)  of  water.  When  concerned  ecolo¬ 
gists  and  scientists  questioned  this  practice,  they  were  assured  by  the 
A.E.C.  that  the  drums  would  never  develop  leaks.  Exhaustive  tests  on  the 
drums  proved  the  A.E.C.  right.  However,  several  engineers  then  raised  the 
question  of  whether  the  drums  could  crack  from  the  impact  of  hitting  the 
ocean  floor.  “Never,”  said  the  A.E.C.  “We’ll  see  about  that,”  said  the  en¬ 
gineers.  After  performing  numerous  experiments,  the  engineers  found  that 
the  drums  could  crack  on  impact  if  their  velocity  exceeded  forty  feet  per 
second.  The  problem  before  us,  therefore,  is  to  compute  the  velocity  of  the 
drums  upon  impact  with  the  ocean  floor.  To  this  end,  we  digress  briefly  to 
study  elementary  Newtonian  mechanics. 

Newtonian  mechanics  is  the  study  of  Newton’s  famous  laws  of  motion 
and  their  consequences.  Newton’s  first  law  of  motion  states  that  an  object 
will  remain  at  rest,  or  move  with  constant  velocity,  if  no  force  is  acting  on 
it.  A  force  should  be  thought  of  as  a  push  or  pull.  This  push  or  pull  can  be 
exerted  directly  by  something  in  contact  with  the  object,  or  it  can  be  ex¬ 
erted  indirectly,  as  the  earth’s  pull  of  gravity  is. 

Newton’s  second  law  of  motion  is  concerned  with  describing  the  motion 
of  an  object  which  is  acted  upon  by  several  forces.  Let  y(t)  denote  the 
position  of  the  center  of  gravity  of  the  object.  (We  assume  that  the  object 
moves  in  only  one  direction.)  Those  forces  acting  on  the  object,  which  tend 
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to  increase  y ,  are  considered  positive,  while  those  forces  tending  to  de¬ 
crease  y  are  considered  negative.  The  resultant  force  F  acting  on  an  object 
is  defined  to  be  the  sum  of  all  positive  forces  minus  the  sum  of  all  negative 
forces.  Newton’s  second  law  of  motion  states  that  the  acceleration  d2y  /  dt1 
of  an  object  is  proportional  to  the  resultant  force  F  acting  on  it;  i.e., 


The  constant  m  is  the  mass  of  the  object.  It  is  related  to  the  weight  W  of 
the  object  by  the  relation  W=mg ,  where  g  is  the  acceleration  of  gravity. 
Unless  otherwise  stated,  we  assume  that  the  weight  of  an  object  and  the  ac¬ 
celeration  of  gravity  are  constant.  We  will  also  adopt  the  English  system  of 
units,  so  that  t  is  measured  in  seconds,  y  is  measured  in  feet,  and  F  is 
measured  in  pounds.  The  units  of  m  are  then  slugs,  and  the  gravitational 
acceleration  g  equals  32.2  ft/s2. 


Remark.  We  would  prefer  to  use  the  mks  system  of  units,  where  y  is 
measured  in  meters  and  F  is  measured  in  newtons.  The  units  of  m  are  then 
kilograms,  and  the  gravitational  acceleration  equals  9.8  m/s2.  In  the  third 
edition  of  this  text,  we  have  changed  from  the  English  system  of  units  to  the 
mks  system  in  Section  2.6.  However,  changing  to  the  mks  system  in  this 
section  would  have  caused  undue  confusion  to  the  users  of  the  first  and 
second  editions.  This  is  because  of  the  truncation  error  involved  in  convert¬ 
ing  from  feet  to  meters  and  pounds  to  newtons. 


We  return  now  to  our  atomic  waste  disposal  problem.  As  a  drum  de¬ 
scends  through  the  water,  it  is  acted  upon  by  three  forces  W ,  B ,  and  D. 
The  force  W  is  the  weight  of  the  drum  pulling  it  down,  and  in  magnitude, 
W  =  527.436  lb.  The  force  B  is  the  buoyancy  force  of  the  water  acting  on 
the  drum.  This  force  pushes  the  drum  up,  and  its  magnitude  is  the  weight 
of  the  water  displaced  by  the  drum.  Now,  the  Atomic  Energy  Commission 
used  55  gallon  drums,  whose  volume  is  7.35  ft3.  The  weight  of  one  cubic 
foot  of  salt  water  is  63.99  lb.  Hence  B  =  (63.99)(7.35)  =  470.327  lb. 

The  force  D  is  the  drag  force  of  the  water  acting  on  the  drum;  it  resists 
the  motion  of  the  drum  through  the  water.  Experiments  have  shown  that 
any  medium  such  as  water,  oil,  and  air  resists  the  motion  of  an  object 
through  it.  This  resisting  force  acts  in  the  direction  opposite  the  motion, 
and  is  usually  directly  proportional  to  the  velocity  V  of  the  object.  Thus, 
Z)  =  cK,  for  some  positive  constant  c.  Notice  that  the  drag  force  increases 
as  V  increases,  and  decreases  as  V  decreases.  To  calculate  Z),  the  engineers 
conducted  numerous  towing  experiments.  They  concluded  that  the  orienta¬ 
tion  of  the  drum  had  little  effect  on  the  drag  force,  and  that 


Z)  =  0.08  V 


(lb)(s) 

ft 
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Now,  sety  =  0  at  sea  level,  and  let  the  direction  of  increasing  y  be  down¬ 
wards.  Then,  W  is  a  positive  force,  and  B  and  D  are  negative  forces.  Con¬ 
sequently,  from  (1), 


=  —  (W-B-cV)=4}(W-B-cV). 
m  W 


We  can  rewrite  this  equation  as  a  first-order  linear  differential  equation  for 
V=dy/dt\  i.e., 


dV 

dt  W 


W-B 


)• 


Initially,  when  the  drum  is  released  in  the  ocean,  its  velocity  is  zero.  Thus, 
V  (t),  the  velocity  of  the  drum,  satisfies  the  initial-value  problem 

W  +  W  v=W(w~B)’  K(0)=0’  (3) 

and  this  implies  that 


V(t)=  —  — B  [l-e(~cg/w)']. 


Equation  (4)  expresses  the  velocity  of  the  drum  as  a  function  of  time. 
In  order  to  determine  the  impact  velocity  of  the  drum,  we  must  compute 
the  time  t  at  which  the  drum  hits  the  ocean  floor.  Unfortunately,  though,  it 
is  impossible  to  find  t  as  an  explicit  function  of  y  (see  Exercise  2).  There¬ 
fore,  we  cannot  use  Equation  (4)  to  find  the  velocity  of  the  drum  when  it 
hits  the  ocean  floor.  However,  the  A.E.C.  can  use  this  equation  to  try  and 
prove  that  the  drums  do  not  crack  on  impact.  To  wit,  observe  from  (4)  that 
V  (0  is  a  monotonic  increasing  function  of  time  which  approaches  the 
limiting  value 

V T  c 

as  /  approaches  infinity.  The  quantity  VT  is  called  the  terminal  velocity  of 
the  drum.  Clearly,  V  ( t )  <  VT ,  so  that  the  velocity  of  the  drum  when  it  hits 
the  ocean  floor  is  certainly  less  than  ( W—B)/c .  Now,  if  this  terminal 
velocity  is  less  than  40  ft/s,  then  the  drums  could  not  possibly  break  on 
impact.  However, 


W-B 

c 


527.436-470.327 

0.08 


=  713.86  ft/s, 


and  this  is  way  too  large. 

It  should  be  clear  now  that  the  only  way  we  can  resolve  the  dispute  be¬ 
tween  the  A.E.C.  and  the  engineers  is  to  find  v(y\  the  velocity  of  the  drum 
as  a  function  of  position.  The  function  v(y)  is  very  different  from  the  func¬ 
tion  V (t)9  which  is  the  velocity  of  the  drum  as  a  function  of  time.  How¬ 
ever,  these  two  functions  are  related  through  the  equation 

V(t)  =  v(y(t)) 
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if  we  express  y  as  a  function  of  t.  By  the  chain  rule  of  differentiation, 
dV /  dt  —  ( dv /  dy)(dy  /  dt).  Hence 


Wdvdy_ 

g  dy  dt 


W-B-cV. 


But  dy /dt  —  V(t)  =  v(y(t)).  Thus,  suppressing  the  dependence  of  y  on  t9  we 
see  that  v(y)  satisfies  the  first-order  differential  equation 


Moreover, 

Hence, 

Now, 


v  B  cv9  or 

g  dy 


v  dv 
W—B  —  cv  dy 


v(0)  =  v(y(0))=V(0)  =  0. 
r  dr  _  Cy  g  ,  _  gy 

W-B-cr  ~  J0  W  W  ‘ 


g_ 

W  * 


fv  rdr  rvr  —  (W—B)/c  W_B  r»  dr 

J0  W-B-cr  J  W-B-cr  dr  c  J0  W-B-cr 

1  fv,  .  W-B  fv  dr 

c  J0  r  c  J0  W-B-cr 

v  (W-B)  \W-B-cv 


We  know  already  that  v  <(W—  B)/ c.  Consequently,  W  —  B  —  cv  is  always 
positive,  and 


gy 


V 

c 


( W-B )  W_B 

- - ln- 


CV 


W  c  c  2  W-B  ' 

At  this  point,  we  are  ready  to  scream  in  despair  since  we  cannot  find  v 
as  an  explicit  function  of  y  from  (5).  This  is  not  an  insurmountable  diffi¬ 
culty,  though.  As  we  show  in  Section  1.1 1,  it  is  quite  simple,  with  the  aid  of 
a  digital  computer,  to  find  u(300)  from  (5).  We  need  only  supply  the  com¬ 
puter  with  a  good  approximation  of  d(300)  and  this  is  obtained  in  the 
following  manner.  The  velocity  v(y)  of  the  drum  satisfies  the  initial-value 
problem 

W  dv 

ILv“E  =  W-B-cv  v(0)  =  0.  (6) 

g  dy  x 


Let  us,  for  the  moment,  set  c  =  0  in  (6)  to  obtain  the  new  initial-value  prob¬ 
lem 


—  u^ -  =  W-B,  w(0)  =  0. 

g  dy 


(We  have  replaced  v  by  u  to  avoid  confusion  later.)  We  can  integrate  (6') 
immediately  to  obtain  that 


Wuj_ 

g  2 


=  (W—  B)y9 


or 


u(y)  = 


WK 


-1 1/2 


B)y 
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In  particular, 


w(300)  = 


2  g  1 

1/2 

'  2(32.2)(57.109)(300)  ' 

~~z  ( W —  B  )300 

yy 

= 

527.436 

-.1/2 


=V2092  ^45.7  ft/s. 


We  claim,  now,  that  w(300)  is  a  very  good  approximation  of  u(300).  The 
proof  of  this  is  as  follows.  First,  observe  that  the  velocity  of  the  drum  is 
always  greater  if  there  is  no  drag  force  opposing  the  motion.  Hence, 

t>(300)<  w(300). 

Second,  the  velocity  v  increases  as  y  increases,  so  that  v(y)  <  n(300)  fory  < 
300.  Consequently,  the  drag  force  D  of  the  water  acting  on  the  drum  is  al¬ 
ways  less  than  0.08  X  u (300)  =  3.7  lb.  Now,  the  resultant  force  W—  B  pull¬ 
ing  the  drum  down  is  approximately  57.1  lb,  which  is  very  large  compared 
to  D.  It  stands  to  reason,  therefore,  that  u(y)  should  be  a  very  good  ap¬ 
proximation  of  v(y).  And  indeed,  this  is  the  case,  since  we  find  numeri¬ 
cally  (see  Section  1.11)  that  u(300)  =  45.1  ft/s.  Thus,  the  drums  can  break 
upon  impact,  and  the  engineers  were  right. 

Epilog.  The  rules  of  the  Atomic  Energy  Commission  now  expressly  for¬ 
bid  the  dumping  of  low  level  atomic  waste  at  sea.  This  author  is  uncertain 
though,  as  to  whether  Western  Europe  has  also  forbidden  this  practice. 


Remark.  The  methods  introduced  in  this  section  can  also  be  used  to  find 
the  velocity  of  any  object  which  is  moving  through  a  medium  that  resists 
the  motion.  We  just  disregard  the  buoyancy  force  if  the  medium  is  not 
water.  For  example,  let  V(t )  denote  the  velocity  of  a  parachutist  falling  to 
earth  under  the  influence  of  gravity.  Then, 

W  dV 
g  dt 

where  W  is  the  weight  of  the  man  and  the  parachute,  and  D  is  the  drag 
force  exerted  by  the  atmosphere  on  the  falling  parachutist.  The  drag  force 
on  a  bluff  object  in  air,  or  in  any  fluid  of  small  viscosity  is  usually  very 
nearly  proportional  to  V 2.  Proportionality  to  V  is  the  exceptional  case,  and 
occurs  only  at  very  low  speeds.  The  criterion  as  to  whether  the  square  or 
the  linear  law  applies  is  the  “Reynolds  number” 

R  =  pVL/p. 

L  is  a  representative  length  dimension  of  the  object,  and  p  and  /x  are  the 
density  and  viscosity  of  the  fluid.  If  R<  10,  then  D~  V,  and  if  R  >  103, 
V2.  For  10<R<  103,  neither  law  is  accurate. 
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Exercises 

1.  Solve  the  initial-value  problem  (3). 

2.  Solve  ior  y=y{t)  from  (4),  and  then  show  that  the  equation  y=y(t)  cannot  be 
solved  explicitly  for  t  =  t(y). 

3.  Show  that  the  drums  of  atomic  waste  will  not  crack  upon  impact  if  they  are 
dropped  into  L  feet  of  water  with  (2 g(W—  B)L/  W)l^2< 40. 

4.  Fat  Richie,  an  enormous  underworld  hoodlum  weighing  400  lb,  was  pushed  out 
of  a  penthouse  window  2800  feet  above  the  ground  in  New  York  City.  Neglect¬ 
ing  air  resistance  find  (a)  the  velocity  with  which  Fat  Richie  hit  the  ground;  (b) 
the  time  elapsed  before  Fat  Richie  hit  the  ground. 

5.  An  object  weighing  300  lb  is  dropped  into  a  river  150  feet  deep.  The  volume  of 
the  object  is  2  ft3,  and  the  drag  force  exerted  by  the  water  on  it  is  0.05  times  its 
velocity.  The  drag  force  may  be  considered  negligible  if  it  does  not  exceed  5% 
of  the  resultant  force  pulling  the  drum  down.  Prove  that  the  drag  force  is  negli¬ 
gible  in  this  case.  (Here  5  =  2(62.4)=  124.8.) 

6.  A  400  lb  sphere  of  volume  4tt/3  and  a  300  lb  cylinder  of  volume  m  are  simulta¬ 
neously  released  from  rest  into  a  river.  The  drag  force  exerted  by  the  water  on 
the  falling  sphere  and  cylinder  is  \FS  and  \VC,  respectively,  where  Vs  and  Vc 
are  the  velocities  of  the  sphere  and  cylinder,  and  X  is  a  positive  constant.  De¬ 
termine  which  object  reaches  the  bottom  of  the  river  first. 

7.  A  parachutist  falls  from  rest  toward  earth.  The  combined  weight  of  man  and 
parachute  is  161  lb.  Before  the  parachute  opens,  the  air  resistance  equals  V / 2. 
The  parachute  opens  5  seconds  after  the  fall  begins;  and  the  air  resistance  is 
then  K2/2.  Find  the  velocity  V (t)  of  the  parachutist  after  the  parachute  opens. 

8.  A  man  wearing  a  parachute  jumps  from  a  great  height.  The  combined  weight 
of  man  and  parachute  is  161  lb.  Let  V(t)  denote  his  speed  at  time  t  seconds 
after  the  fall  begins.  During  the  first  10  seconds,  the  air  resistance  is  K/2. 
Thereafter,  while  the  parachute  is  open,  the  air  resistance  is  10K.  Find  an  ex¬ 
plicit  formula  for  V(t)  at  any  time  t  greater  than  10  seconds. 

9.  An  object  of  mass  m  is  projected  vertically  downward  with  initial  velocity  V0  in 
a  medium  offering  resistance  proportional  to  the  square  root  of  the  magnitude 
of  the  velocity. 

(a)  Find  a  relation  between  the  velocity  V  and  the  time  t  if  the  drag  force 
equals  cVK  . 

(b)  Find  the  terminal  velocity  of  the  object.  Hint:  You  can  find  the  terminal 
velocity  even  though  you  cannot  solve  for  V  (/). 

10.  A  body  of  mass  m  falls  from  rest  in  a  medium  offering  resistance  proportional 
to  the  square  of  the  velocity;  that  is,  D  =  cV2.  Find  V(t)  and  compute  the 
terminal  velocity  VT. 

11.  A  body  of  mass  m  is  projected  upward  from  the  earth’s  surface  with  an  initial 
velocity  V0.  Take  the  y-axis  to  be  positive  upward,  with  the  origin  on  the 
surface  of  the  earth.  Assuming  there  is  no  air  resistance,  but  taking  into 
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account  the  variation  of  the  earth’s  gravitational  field  with  altitude,  we  obtain 
that 

dV  rngR 2 

( y  +  R )2 

where  R  is  the  radius  of  the  earth. 

(a)  Let  V(t)  =  v(y(t)).  Find  a  differential  equation  satisfied  by  t>(y). 

(b)  Find  the  smallest  initial  velocity  V0  for  which  the  body  will  not  return  to 
earth.  This  is  the  so-called  escape  velocity.  Hint:  The  escape  velocity  is 
found  by  requiring  that  t>(y)  remain  strictly  positive. 

12.  It  is  not  really  necessary  to  find  t>(y)  explicitly  in  order  to  prove  that  t>(300) 
exceeds  40  ft/s.  Here  is  an  alternate  proof.  Observe  first  that  v(y)  increases  as 
y  increases.  This  implies  that  y  is  a  monotonic  increasing  function  of  v.  There¬ 
fore,  if  y  is  less  than  300  ft  when  v  is  40  ft/s,  then  v  must  be  greater  than  40 
ft/s  wheny  is  300  ft.  Substitute  t>  =  40  ft/s  in  Equation  (5),  and  show  thaty  is 
less  than  300  ft.  Conclude,  therefore,  that  the  drums  can  break  upon  impact. 


1.8  The  dynamics  of  tumor  growth,  mixing  problems 
and  orthogonal  trajectories 

In  this  section  we  present  three  very  simple  but  extremely  useful  applica¬ 
tions  of  first-order  equations.  The  first  application  concerns  the  growth  of 
solid  tumors;  the  second  application  is  concerned  with  “mixing  problems” 
or  “compartment  analysis”;  and  the  third  application  shows  how  to  find  a 
family  of  curves  which  is  orthogonal  to  a  given  family  of  curves. 

(a)  The  dynamics  of  tumor  growth 

It  has  been  observed  experimentally,  that  “free  living”  dividing  cells,  such 
as  bacteria  cells,  grow  at  a  rate  proportional  to  the  volume  of  dividing  cells 
at  that  moment.  Let  V(t)  denote  the  volume  of  dividing  cells  at  time  t. 
Then, 

^  =\V  (1) 

dt  v  ’ 

for  some  positive  constant  A.  The  solution  of  (1)  is 

V(t)=v 0ex('-,")  (2) 

where  V0  is  the  volume  of  dividing  cells  at  the  initial  time  t0.  Thus,  free 
living  dividing  cells  grow  exponentially  with  time.  One  important  con¬ 
sequence  of  (2)  is  that  the  volume  of  cells  keeps  doubling  (see  Exercise  1) 
every  time  interval  of  length  In2/A. 

On  the  other  hand,  solid  tumors  do  not  grow  exponentially  with  time. 
As  the  tumor  becomes  larger,  the  doubling  time  of  the  total  tumor  volume 
continuously  increases.  Various  researchers  have  shown  that  the  data  for 
many  solid  tumors  is  fitted  remarkably  well,  over  almost  a  1000  fold  in- 
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crease  in  tumor  volume,  by  the  equation 

K(0-Koexp(£(l-exp(-a0))  (3) 

where  exp(x)  =  ex,  and  X  and  a  are  positive  constants. 

Equation  (3)  is  usually  referred  to  as  a  Gompertzian  relation.  It  says 
that  the  tumor  grows  more  and  more  slowly  with  the  passage  of  time,  and 
that  it  ultimately  approaches  the  limiting  volume  VQex^a.  Medical  re¬ 
searchers  have  long  been  concerned  with  explaining  this  deviation  from 
simple  exponential  growth.  A  great  deal  of  insight  into  this  problem  can  be 
gained  by  finding  a  differential  equation  satisfied  by  V(t).  Differentiating 
(3)  gives 

=  K0Aexp(  —  at)  exp(^(l  —exp( —  «(/))) j 

=  \e~a,V.  (4) 

Two  conflicting  theories  have  been  advanced  for  the  dynamics  of  tumor 
growth.  They  correspond  to  the  two  arrangements 


dV 


(4a) 

(4b) 


of  the  differential  equation  (4).  According  to  the  first  theory,  the  retarding 
effect  of  tumor  growth  is  due  to  an  increase  in  the  mean  generation  time  of 
the  cells,  without  a  change  in  the  proportion  of  reproducing  cells.  As  time 
goes  on,  the  reproducing  cells  mature,  or  age,  and  thus  divide  more  slowly. 
This  theory  corresponds  to  the  bracketing  (a). 

The  bracketing  (b)  suggests  that  the  mean  generation  time  of  the  divid¬ 
ing  cells  remains  constant,  and  the  retardation  of  growth  is  due  to  a  loss  in 
reproductive  cells  in  the  tumor.  One  possible  explanation  for  this  is  that  a 
necrotic  region  develops  in  the  center  of  the  tumor.  This  necrosis  appears  at 
a  critical  size  for  a  particular  type  of  tumor,  and  thereafter  the  necrotic 
“core”  increases  rapidly  as  the  total  tumor  mass  increases.  According  to 
this  theory,  a  necrotic  core  develops  because  in  many  tumors  the  supply  of 
blood,  and  thus  of  oxygen  and  nutrients,  is  almost  completely  confined  to 
the  surface  of  the  tumor  and  a  short  distance  beneath  it.  As  the  tumor 
grows,  the  supply  of  oxygen  to  the  central  core  by  diffusion  becomes  more 
and  more  difficult  resulting  in  the  formation  of  a  necrotic  core. 


(b)  Mixing  problems 

Many  important  problems  in  biology  and  engineering  can  be  put  into  the 
following  framework.  A  solution  containing  a  fixed  concentration  of  sub¬ 
stance  x  flows  into  a  tank,  or  compartment,  containing  the  substance  x 
and  possibly  other  substances,  at  a  specified  rate.  The  mixture  is  stirred 
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together  very  rapidly,  and  then  leaves  the  tank,  again  at  a  specified  rate. 
Find  the  concentration  of  substance  x  in  the  tank  at  any  time  t. 

Problems  of  this  type  fall  under  the  general  heading  of  “mixing  prob¬ 
lems,”  or  compartment  analysis.  The  following  example  illustrates  how  to 
solve  these  problems. 


Example  1.  A  tank  contains  S0  lb  of  salt  dissolved  in  200  gallons  of  water. 
Starting  at  time  /  =  0,  water  containing  \  lb  of  salt  per  gallon  enters  the 
tank  at  the  rate  of  4  gal/min,  and  the  well  stirred  solution  leaves  the  tank 
at  the  same  rate.  Find  the  concentration  of  salt  in  the  tank  at  any  time  t  > 
0. 

Solution .  Let  S(t)  denote  the  amount  of  salt  in  the  tank  at  time  t.  Then, 
S'(t),  which  is  the  rate  of  change  of  salt  in  the  tank  at  time  t ,  must  equal 
the  rate  at  which  salt  enters  the  tank  minus  the  rate  at  which  it  leaves  the 
tank.  Obviously,  the  rate  at  which  salt  enters  the  tank  is 

\  lb /gal  times  4  gal/min  =  2  lb /min. 


After  a  moment’s  reflection,  it  is  also  obvious  that  the  rate  at  which  salt 
leaves  the  tank  is 


Thus 


4  gal  /  min  times 


S{t) 
200  ' 


S'(t)  =  2- 


S(i) 

50 


S(0)  =  S0, 


and  this  implies  that 

5  ( t)  =  S0e ~  °*02'  +  100(1  —  e~  °02' ) . 
Hence,  the  concentration  c(t)  of  salt  in  the  tank  is  given  by 


c(t)  = 


S(t) 


=  — o  ~ 0-02/  1  ^ |  —  0~ 0.02/ 


200  200 


)• 


(5) 

(6) 


Remark.  The  first  term  on  the  right-hand  side  of  (5)  represents  the  por¬ 
tion  of  the  original  amount  of  salt  remaining  in  the  tank  at  time  t.  This 
term  becomes  smaller  and  smaller  with  the  passage  of  time  as  the  original 
solution  is  drained  from  the  tank.  The  second  term  on  the  right-hand  side 
of  (5)  represents  the  amount  of  salt  in  the  tank  at  time  t  due  to  the  action 
of  the  flow  process.  Clearly,  the  amount  of  salt  in  the  tank  must  ultimately 
approach  the  limiting  value  of  100  lb,  and  this  is  easily  verified  by  letting  t 
approach  oo  in  (5). 

(c)  Orthogonal  trajectories 

In  many  physical  applications,  it  is  often  necessary  to  find  the  orthogonal 
trajectories  of  a  given  family  of  curves.  (A  curve  which  intersects  each 
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member  of  a  family  of  curves  at  right  angles  is  called  an  orthogonal  trajec¬ 
tory  of  the  given  family.)  For  example,  a  charged  particle  moving  under 
the  influence  of  a  magnetic  field  always  travels  on  a  curve  which  is  per¬ 
pendicular  to  each  of  the  magnetic  field  lines.  The  problem  of  computing 
orthogonal  trajectories  of  a  family  of  curves  can  be  solved  in  the  following 
manner.  Let  the  given  family  of  curves  be  described  by  the  relation 


F(x,y,c)  =  0. 


Differentiating  this  equation  yields 


Fx  +  F  y'  =  0, 


Next,  we  solve  for  c  =  c(x,y)  from  (7)  and  replace  every  c  in  (8)  by  this 
value  c(x,y).  Finally,  since  the  slopes  of  curves  which  intersect  orthogo¬ 
nally  are  negative  reciprocals  of  each  other,  we  see  that  the  orthogonal 
trajectories  of  (7)  are  the  solution  curves  of  the  equation 


Example  2.  Find  the  orthogonal  trajectories  of  the  family  of  parabolas 

x  =  cy2. 

Solution.  Differentiating  the  equation  x  —  cy2  gives  1=2 cyy'.  Since  c  = 
x/y 2,  we  see  thsity'=y/2x.  Thus,  the  orthogonal  trajectories  of  the  family 

y 

a 


Figure  1.  The  parabolas  x  =  cy2  and  their  orthogonal  trajectories 
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of  parabolas  x  =  cy2  are  the  solution  curves  of  the  equation 

y=-y.  (10) 

This  equation  is  separable,  and  its  solution  is 

y2  +  2x2  =  k 2.  (11) 

Thus,  the  family  of  ellipses  (11)  (see  Figure  1)  are  the  orthogonal  trajecto¬ 
ries  of  the  family  of  parabolas  x  —  cy 2. 

Reference 

Burton,  Alan  C.,  Rate  of  growth  of  solid  tumors  as  a  problem  of  diffusion,  Growth , 
1966,  vol.  30,  pp.  157-176. 

Exercises 

1.  A  given  substance  satisfies  the  exponential  growth  law  (1).  Show  that  the  graph 
of  In  V  versus  t  is  a  straight  line. 

2.  A  substance  x  multiplies  exponentially,  and  a  given  quantity  of  the  substance 
doubles  every  20  years.  If  we  have  3  lb  of  substance  x  at  the  present  time,  how 
many  lb  will  we  have  7  years  from  now? 

3.  A  substance  x  decays  exponentially,  and  only  half  of  the  given  quantity  of  x 
remains  after  2  years.  How  long  does  it  take  for  5  lb  of  jc  to  decay  to  1  lb? 

4.  The  equation  p'  =  apa ,  a>  1,  is  proposed  as  a  model  of  the  population  growth 
of  a  certain  species.  Show  that  p{t)^>  oo  in  finite  time.  Conclude,  therefore,  that 
this  model  is  not  accurate  over  a  reasonable  length  of  time. 

5.  A  cancerous  tumor  satisfies  the  Gompertzian  relation  (3).  Originally,  when  it 
contained  104  cells,  the  tumor  was  increasing  at  the  rate  of  20%  per  unit  time. 
The  numerical  value  of  the  retarding  constant  a  is  0.02.  What  is  the  limiting 
number  of  cells  in  this  tumor? 

6.  A  tracer  dose  of  radioactive  iodine  131 1  is  injected  into  the  blood  stream  at  time 
t  =  0.  Assume  that  the  original  amount  Q0  of  iodine  is  distributed  evenly  in  the 
entire  blood  stream  before  any  loss  occurs.  Let  Q(t)  denote  the  amount  of 
iodine  in  the  blood  at  time  t  >  0.  Part  of  the  iodine  leaves  the  blood  and  enters 
the  urine  at  the  rate  kxQ,  Another  part  of  the  iodine  enters  the  thyroid  gland  at 
the  rate  k2Q .  Find  Q(t). 

7.  Industrial  waste  is  pumped  into  a  tank  containing  1000  gallons  of  water  at  the 
rate  of  1  gal/min,  and  the  well-stirred  mixture  leaves  the  tank  at  the  same  rate, 
(a)  Find  the  concentration  of  waste  in  the  tank  at  time  t.  (b)  How  long  does  it 
take  for  the  concentration  to  reach  20%? 

8.  A  tank  contains  300  gallons  of  water  and  100  gallons  of  pollutants.  Fresh  water 
is  pumped  into  the  tank  at  the  rate  of  2  gal/min,  and  the  well-stirred  mixture 
leaves  at  the  same  rate.  How  long  does  it  take  for  the  concentration  of  pollu¬ 
tants  in  the  tank  to  decrease  to  1/10  of  its  original  value? 
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9.  Consider  a  tank  containing,  at  time  /  =  0,  Q0  lb  of  salt  dissolved  in  150  gallons 
of  water.  Assume  that  water  containing  \  lb  of  salt  per  gallon  is  entering  the 
tank  at  a  rate  of  3  gal/min,  and  that  the  well-stirred  solution  is  leaving  the  tank 
at  the  same  rate.  Find  an  expression  for  the  concentration  of  salt  in  the  tank  at 
time  t. 

10.  A  room  containing  1000  cubic  feet  of  air  is  originally  free  of  carbon  monoxide. 
Beginning  at  time  /  =  0,  cigarette  smoke  containing  4  percent  carbon  monoxide 
is  blown  into  the  room  at  the  rate  of  0.1  ft3/min,  and  the  well-circulated  mix¬ 
ture  leaves  the  room  at  the  same  rate.  Find  the  time  when  the  concentration  of 
carbon  monoxide  in  the  room  reaches  0.012  percent.  (Extended  exposure  to 
this  concentration  of  carbon  monoxide  is  dangerous.) 

11.  A  500  gallon  tank  originally  contains  100  gallons  of  fresh  water.  Beginning  at 
time  t  =  0,  water  containing  50  percent  pollutants  flows  into  the  tank  at  the  rate 
of  2  gal/min,  and  the  well-stirred  mixture  leaves  at  the  rate  of  1  gal/min.  Find 
the  concentration  of  pollutants  in  the  tank  at  the  moment  it  overflows. 

In  Exercises  12-17,  find  the  orthogonal  trajectories  of  the  given  family  of 
curves. 

12.  y  =  cx2  13.  y2~~  x2  =  c 

14.  y  =  csinx  15.  x2+y2  =  cx  (see  Exercise  13  of  Sec¬ 

tion  1.4) 

16.  y  —  cex  ll.y  =  ecx 

18.  The  presence  of  toxins  in  a  certain  medium  destroys  a  strain  of  bacteria  at  a 
rate  jointly  proportional  to  the  number  of  bacteria  present  and  to  the  amount 
of  toxin.  Call  the  constant  of  proportionality  a.  If  there  were  no  toxins  present, 
the  bacteria  would  grow  at  a  rate  proportional  to  the  amount  present.  Call  this 
constant  of  proportionality  b.  Assume  that  the  amount  T  of  toxin  is  increasing 
at  a  constant  rate  c,  that  is,  dT /  dt  —  c,  and  that  the  production  of  toxins  begins 
at  time  f  =  0.  Lety(/)  denote  the  number  of  living  bacteria  present  at  time  t. 

(a)  Find  a  first-order  differential  equation  satisfied  by  y(0- 

(b)  Solve  this  differential  equation  to  obtain  y(t).  What  happens  to  y(t)  as  /  ap¬ 
proaches  oo? 

19.  Many  savings  banks  now  advertise  continuous  compounding  of  interest.  This 
means  that  the  amount  of  money  P(t)  on  deposit  at  time  /,  satisfies  the  dif¬ 
ferential  equation  dP(t)/dt  =  rP(t)  where  r  is  the  annual  interest  rate  and  /  is 
measured  in  years.  Let  P0  denote  the  original  principal. 

(a)  Show  that  P(\)  =  P0er. 

(b)  Let  r  =  0.0575,  0.065,  0.0675,  and  0.075.  Show  that  er  =  1.05919,  1.06716, 
1.06983,  and  1.07788,  respectively.  Thus,  the  effective  annual  yield  on  inter¬ 
est  rates  of  5|,  6±,  6f ,  and  1\%  should  be  5.919,  6.716,  6.983,  and  7.788%, 
respectively.  Most  banks,  however,  advertise  effective  annual  yields  of  6, 
6.81,  7.08,  and  7.9%,  respectively.  The  reason  for  this  discrepancy  is  that 
banks  calculate  a  daily  rate  of  interest  based  on  360  days,  and  they  pay  in¬ 
terest  for  each  day  money  is  on  deposit.  For  a  year,  one  gets  five  extra 
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days.  Thus,  we  must  multiply  the  annual  yields  of  5.919,  6.716,  6.983,  and 
7.788%  by  365/360,  and  then  we  obtain  the  advertised  values. 

(c)  It  is  interesting  to  note  that  the  Old  Colony  Cooperative  Bank  in  Rhode  Is¬ 
land  advertises  an  effective  annual  yield  of  6.72%  on  an  annual  interest  rate 
of  6^%  (the  lower  value),  and  an  effective  annual  yield  of  7.9%  on  an  ann¬ 
ual  interest  rate  of  7^%.  Thus  they  are  inconsistent. 


1.9  Exact  equations,  and  why  we  cannot  solve 
very  many  differential  equations 

When  we  began  our  study  of  differential  equations,  the  only  equation  we 
could  solve  was  dy / dt  =  g(t).  We  then  enlarged  our  inventory  to  include 
all  linear  and  separable  equations.  More  generally,  we  can  solve  all  dif¬ 
ferential  equations  which  are,  or  can  be  put,  in  the  form 

T, ■*'-?)- o  0) 

for  some  function  </>(/, y).  To  wit,  we  can  integrate  both  sides  of  (1)  to  ob¬ 
tain  that 

<>(/,  y)  =  constant  (2) 

and  then  solve  for  y  as  a  function  of  t  from  (2). 

Example  1.  The  equation  1  +cos(f  +>>)  +  cos(/+  y)(<ty / dt)  =  Q  can  be  writ¬ 
ten  in  the  form  (d/dt)[t  +  sin(t+y)\  =  0.  Hence, 

<l>(t,y)  =  t  +  sin(f+7)  =  c,  and  y=  —  /  +  arcsin(c  —  t). 


Example  2.  The  equation  cos(/+>>)  +  [1  +  cos(t+y)\dy /dt  =  0  can  be  writ¬ 
ten  in  the  form  (d/ di)[y  +  sm(t+y)]  =  0.  Hence, 

<HO0  =y  +  sin(/  +y)  =  c. 

We  must  leave  the  solution  in  this  form  though,  since  we  cannot  solve  for  y 
explicitly  as  a  function  of  time. 


Equation  (1)  is  clearly  the  most  general  first-order  differential  equation 
that  we  can  solve.  Thus,  it  is  important  for  us  to  be  able  to  recognize  when 
a  differential  equation  can  be  put  in  this  form.  This  is  not  as  simple  as  one 
might  expect.  For  example,  it  is  certainly  not  obvious  that  the  differential 
equation 


2t+y  —  sin/  +  (3y2  +  cos>>  + 1)^  =0 


can  be  written  in  the  form  (d / dt)(y3  +  t2  +  ty  +  siny  +  cos t)  =  0.  To  find  all 
those  differential  equations  which  can  be  written  in  the  form  (1),  observe, 
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from  the  chain  rule  of  partial  differentiation,  that 


d<f> 

~di 


d<t>  dy 
dy  dt  * 


Hence,  the  differential  equation  M  (t,y)  +  N  (t,y)(cfy  /  dt)  =  0  can  be  written 
in  the  form  (d/dt)<j>(t9y)  =  0  if  and  only  if  there  exists  a  function  <f>(t,y) 
such  that  M(t,y)  =  d<j>/dt  and  N(t,y)=  d<j>/dy. 

This  now  leads  us  to  the  following  question.  Given  two  functions 
M(t,y)  and  N(t,y ),  does  there  exist  a  function  </>(/, y)  such  that  M(t,y) 
=  d<t>/dt  and  N(t,y)=d<f>/dyl  Unfortunately,  the  answer  to  this  question 
is  almost  always  no  as  the  following  theorem  shows. 


Theorem  1.  Let  M(t,y)  and  N(t,y)  be  continuous  and  have  continuous  par¬ 
tial  derivatives  with  respect  to  t  andy  in  the  rectangle  R  consisting  of  those 
points  ( t,y )  with  a<t<b  and  c<y<d.  There  exists  a  function  <£>(/, y) 
such  that  M(t,y)=  d<j>/d t  and  N(7,y)  =  d<j>/dy  if  and  only  if 

dM  /  dy  =  dN  /dt 

in  R. 


Proof.  Observe  that  M(t,y)=*  d<j>/d t  for  some  function  <l>(t,y)  if,  and  only 

if, 

$(Uy)  —  j  M  (t,y)df  +  h(y)  (3) 

where  h{y)  is  an  arbitrary  function  of  y.  Taking  partial  derivatives  of  both 
sides  of  (3)  with  respect  to  y,  we  obtain  that 


dt  +  h'(y). 


Hence,  d<j>/dy  will  be  equal  to  N(t,y )  if,  and  only  if, 

3  M(t,y) 


N 


/vivj.  yifVi 

- — dt  +  h'(y) 


or 


fdM(t,y) 

h  (y)  =  N(t,y)  -  J - ^ -dt.  (4) 

Now  h\y)  is  a  function  of  y  alone,  while  the  right-hand  side  of  (4)  appears 
to  be  a  function  of  both  t  andy.  But  a  function  ofy  alone  cannot  be  equal 
to  a  function  of  both  /  and  y.  Thus  Equation  (4)  makes  sense  only  if  the 
right-hand  side  is  a  function  of  y  alone,  and  this  is  the  case  if,  and  only  if, 


a_ 

dt 


N(t,y)-f 


9  M(t,y)  ] 
— ; - dt 


3  N  3  M 
3 1  3 
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Hence,  if  dN/dt^  dM/dy,  then  there  is  no  function  <t>(t9y)  such  that  M 
=  d<j>/dt9  N=  3<J>/3 y.  On  the  other  hand,  if  dN/dt  =  dM/dy  then  we  can 
solve  for 


h(y)~  f  N(t>y)-f 


dM(t,y) 

dy 


dt 


Consequently,  M  =  d<j>/dt9  and  N=d<j>/dy  with 


<HO>)  =  J  M(t,y)dt  +  J 


dM  (t,y) 
dy 


dt 


□  (5) 


Definition.  The  differential  equation 


M(t,y)  +  N(t,y)^  =  0 


is  said  to  be  exact  if  dM/dy  =  dN /dt. 

The  reason  for  this  definition,  of  course,  is  that  the  left-hand  side  of  (6) 
is  the  exact  derivative  of  a  known  function  of  t  and y  if  dM/dy  =  dN/dt. 

Remark  1.  It  is  not  essential,  in  the  statement  of  Theorem  1,  that  dM/dy 
=  dN/dt  in  a  rectangle.  It  is  sufficient  if  dM/dy  =  dN/dt  in  any  region  R 
which  contains  no  “holes”.  That  is  to  say,  if  C  is  any  closed  curve  lying  en¬ 
tirely  in  R ,  then  its  interior  also  lies  entirely  in  R. 

Remark  2.  The  differential  equation  dy /dt=f(t9y)  can  always  be  written 
in  the  form  M  (t,y)  + N(t,y)(dy /dt)  =  0  by  setting  M  (t9y)  =  —f(t,y)  and 
#  (/,y)=l. 

Remark  3.  It  is  customary  to  say  that  the  solution  of  an  exact  differential 
equation  is  given  by  <$>{t9 y)  =  constant.  What  we  really  mean  is  that  the 
equation  <$>(t9y)  =  c  is  to  be  solved  fory  as  a  function  of  t  and  c.  Unfor¬ 
tunately,  most  exact  differential  equations  cannot  be  solved  explicitly  fory 
as  a  function  of  t.  While  this  may  appear  to  be  very  disappointing,  we  wish 
to  point  out  that  it  is  quite  simple,  with  the  aid  of  a  computer,  to  compute 
y(t)  to  any  desired  accuracy  (see  Section  1.11). 

In  practice,  we  do  not  recommend  memorizing  Equation  (5).  Rather,  we 
will  follow  one  of  three  different  methods  to  obtain  <j>(t9y). 

First  Method :  The.  equation  M(t,y)=  3 <j>/  dt  determines  </>(/, y)  up  to  an  ar¬ 
bitrary  function  of  y  alone,  that  is, 

<>{uy)  =  ,y)dt  +  h(y). 
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The  function  h(y)  is  then  determined  from  the  equation 

r  3 M(t,y) 

h'(y)  =  N(t,y)~  J - ^ — dt. 

Second  Method :  If  N  (t,y)=  dip/  dy,  then,  of  necessity, 

(N(,  ,y)dy  +  k(t ) 

where  k(t)  is  an  arbitrary  function  of  t  alone.  Since 


3<f>  fdN(t,y) 


3 1 


dy  +  k'(t) 


we  see  that  k(t)  is  determined  from  the  equation 

3  N(t,y) 


k'(t)  =  M  (t,y)  —  J 


3 1 


dy. 


Note  that  the  right-hand  side  of  this  equation  (see  Exercise  2)  is  a  function 
of  t  alone  if  dM/dy  =  dN/dt. 

Third  Method :  The  equations  d<j>/dt  =  M(t,y)  and  9<J>/9y  =  N(t,y)  imply 
that 

<t>(t,y)  =  jM(t,y)dt  +  h(y)  and  <t>(t,y)  =  j N {t,y)dy  +  k(t). 
Usually,  we  can  determine  h(y)  and  k(t)  just  by  inspection. 


Example  3.  Find  the  general  solution  of  the  differential  equation 


3y  +  e*  +  (3t  +  cosy) 


dy 

dt 


Solution.  Here  M (t,y)  =  3y  +  el  and  N(t,y)  =  3t  +  cosy.  This  equation  is 
exact  since  dM/dy  =  3  and  dN/dt  =  3.  Hence,  there  exists  a  function  <t>(tfy) 
such  that 


3y  +  et  = 


d<j> 

~dt 


and 


(ii)  3/  +  cosy  = 


We  will  find  <l>(t,y)  by  each  of  the  three  methods  outlined  above. 


First  Method :  From  (i),  <f>(t,y)  =  et  +  3ty  +  h(y).  Differentiating  this  equa¬ 
tion  with  respect  to  y  and  using  (ii)  we  obtain  that 


h\y)  +  3t  =  3t  +  cosy. 


Thus,  A(y)  =  siny  and  <j>(t,y)  =  ef  +  3ty  +  siny.  (Strictly  speaking,  h(y)=* 
siny  +  constant.  However,  we  already  incorporate  this  constant  of  integra¬ 
tion  into  the  solution  when  we  write  <j>(t,y)  =  c.)  The  general  solution  of  the 
differential  equation  must  be  left  in  the  form  e*  +  3ty  H-siny  =  c  since  we 
cannot  find  y  explicitly  as  a  function  of  t  from  this  equation. 

Second  Method :  From  (ii),  </>(/, y)  =  3 ty  +  siny  +  k(t).  Differentiating  this 
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expression  with  respect  to  t9  and  using  (i)  we  obtain  that 

3y  +  k\t)  =  3y  +  el. 

Thus,  &(/)  =  el  and  <f>(t9y)  =  3ty  +  siny  +  el. 

Third  Method *  From  (i)  and  (ii) 

<l>(t9y)  =  el  +  3ty  4-  h(y)  and  <j>(t, y)  =  3 ty  +  siny  +  k(t). 

Comparing  these  two  expressions  for  the  same  function  <j>(t9y)  it  is  obvious 
that  h(y)  =  smy  and  k^  —  e1.  Hence 

<t>(  t9y )  =  e '  -I-  3  ty  +  siny . 


Example  4.  Find  the  solution  of  the  initial-value  problem 

3 12 y  +  %ty2  +  (t3  +  &t2 y  +  12 y2)  ■—■=(),  y{ 2)=  1. 

Solution.  Here  M (t9y)  =  3t2y  4-  Sty2  and  N(t9y)=  t3  +  St2y  +  12y2.  This 
equation  is  exact  since 

=3/2+  \6ty  and  =3r2+  16(y. 

3y  3/ 

Hence,  there  exists  a  function  <j>(t9y)  such  that 

,2 _  9<> 


(i)  3t  y  +  Sty‘ 


and  (ii)  t3  +  %t2y  +  12y2  = 


3<J> 


3*  -  v“/  -  ■  -  '  ■  dy  • 

Again,  we  will  find  <l>(t9y)  by  each  of  three  methods. 

First  Method :  From  (i),  <f>(t9y)—  t3y  +  4/2y2  +  h(y).  Differentiating  this 
equation  with  respect  to  y  and  using  (ii)  we  obtain  that 

t3  +  8  t2y  +  h\y)  =  t3  +  St2y  +  12 y2. 


Hence,  h(y)  =  4y3  and  the  general  solution  of  the  differential  equation  is 
<l>(t9y)  =t*y  +  4 t2y2  +  4y3  =  c.  Setting  t  =  2  and  y  —  1  in  this  equation,  we  see 
that  c  =  28.  Thus,  the  solution  of  our  initial-value  problem  is  defined  im¬ 
plicitly  by  the  equation  /^y  +  4/2y2  +  4y3  =  28. 

Second  Method :  From  (ii),  <j>(t9y)  =  t3y  +  4/2y2  +  4y3  +  k(t).  Differentiating 
this  expression  with  respect  to  t  and  using  (i)  we  obtain  that 

3  t2y  +  Sty2  +  k\t)  =  3t2y  +  Sty2. 

Thus  k(t)  =  0  and  <j>(t9y)=  t3y  -\-4t2y2-\-4y3. 

Third  Method:  From  (i)  and  (ii) 

<t>(t9y)  =  t3y  +  4*2y2  +  h(y)  and  <j>(t9y)  =  t3y  +  4t2y2  +  4y3  +  k(t). 

Comparing  these  two  expressions  for  the  same  function  <j>(t9y)  we  see  that 
h(y)  =  4y3  and  k{t)  —  0.  Hence,  <$>(t9y)=t3y  +  4t2y2  +  4y3. 

In  most  instances,  as  Examples  3  and  4  illustrate,  the  third  method  is 
the  simplest  to  use.  However,  if  it  is  much  easier  to  integrate  N  with  re- 
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spect  to  y  than  it  is  to  integrate  M  with  respect  to  t ,  we  should  use  the  sec¬ 
ond  method,  and  vice-versa. 


Example  5.  Find  the  solution  of  the  initial-value  problem 

4t3e,+y  +  t4e‘+y  +  2t  +  (t4e‘+y  +  2y)^-  =0,  >'(0)=1. 

Solution .  This  equation  is  exact  since 

(4  t3e,+y  +  t4e,+y  +  2t)  =  (t4  +  4t3)e'+y  =  -|y  (f  V+^  +  2 y). 


Hence,  there  exists  a  function  <j>(t, y)  such  that 


and 


(i)  4t3et+y  +  t4e‘+y  +  2t  = 


d<j> 

IF 


(ii)  t4et+y  +  2y  = 


d<j> 

9y  * 


Since  it  is  much  simpler  to  integrate  t4et+y  +  2y  with  respect  toy  than  it  is 
to  integrate  4 t3et+y  +  t4et+y  +  2t  with  respect  to  /,  we  use  the  second 
method.  From  (ii),  <f>(t,y)=  t4et+y  +y2  +  k(t).  Differentiating  this  expres¬ 
sion  with  respect  to  t  and  using  (i)  we  obtain 


(t4  +  4t3)et+y  +  k'(t)  =  4t3et+y  +  t4et+y  +  2t. 

Thus,  k{t)=t 2  and  the  general  solution  of  the  differential  equation  is 
<t>(t,y)=  t4et’¥y  +y2+  t2=  c.  Setting  f  =  0  and.y=l  in  this  equation  yields  c 
=  1 .  Thus,  the  solution  of  our  initial-value  problem  is  defined  implicitly  by 
the  equation  t4et+y  +  t2+y2=  1. 


Suppose  now  that  we  are  given  a  differential  equation 

M(t,y)  +  N(t,y)^=0 


which  is  not  exact.  Can  we  make  it  exact?  More  precisely,  can  we  find  a 
function  \i{t,y)  such  that  the  equivalent  differential  equation 

n(t,y)M(t,y)  +  n(t,y)N  ( t,y )  ^  =  0  (8) 

is  exact?  This  question  is  simple,  in  principle,  to  answer.  The  condition  that 
(8)  be  exact  is  that 


or 


( t,y ))  =  Jy  ( n(t,y)N(t,y)) 


(For  simplicity  of  writing,  we  have  suppressed  the  dependence  of  /x,M  and 
N  on  t  and  y  in  (9).)  Thus,  Equation  (8)  is  exact  if  and  only  if  fi(t,y)  satis¬ 
fies  Equation  (9). 
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Definition.  A  function  \i(t,y)  satisfying  Equation  (9)  is  called  an  integrat¬ 
ing  factor  for  the  differential  equation  (7). 


The  reason  for  this  definition,  of  course,  is  that  if  ft  satisfies  (9)  then  we 
can  write  (8)  in  the  form  ( d/dt)<j>(t,y)  =  Q  and  this  equation  can  be  in¬ 
tegrated  immediately  to  yield  the  solution  <f>(t,y)=c.  Unfortunately, 
though,  there  are  only  two  special  cases  where  we  can  find  an  explicit  solu¬ 
tion  of  (9).  These  occur  when  the  differential  equation  (7)  has  an  integrat¬ 
ing  factor  which  is  either  a  function  of  t  alone,  or  a  function  of  y  alone. 
Observe  that  if  fx  is  a  function  of  /  alone,  then  Equation  (9)  reduces  to 


N 


d[i 

dt 


9  M 
dy 


9  TV 
9/ 


9  M  M\ 
9 y  9 1  ) 


But  this  equation  is  meaningless  unless  the  expression 

9  M  9  N 
9 y  9 1 

N 

is  a  function  of  t  alone,  that  is, 


9  M  9  N 


9/ 


N 


-*(0- 


If  this  is  the  case  then  ju(7)  =  exp 
differential  equation  (7). 


is  an  integrating  factor  for  the 


Remark.  It  should  be  noted  that  the  expression 

9M  9  N 
9 y  dt 

N 

is  almost  always  a  function  of  both  t  andy.  Only  for  very  special  pairs  of 
functions  M(t,y)  and  N(t,y )  is  it  a  function  of  t  alone.  A  similar  situatioh 
occurs  if  /x  is  a  function  of  y  alone  (see  Exercise  17).  It  is  for  this  reason 
that  we  cannot  solve  very  many  differential  equations. 

Example  6.  Find  the  general  solution  of  the  differential  equation 

-y  +2ye'  +  (>’  +  e')^=0. 

Solution.  Here  M (t.y)  =  ( v2/2)  +  2ye‘  and  N(t,y)=y  +  e‘.  This  equation  is 
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not  exact  since  dM  /dy  —y  +  2el  and  dN  /  dt  =  e( .  However, 


1[3  M  dN\  >’  +  <?'  . 
N\dy  dt  }  y  +  e‘  ' 


Hence,  this  equation  has  n(t)  =  exp(j  1  dt\  =  e’  as  an  integrating  factor. 
This  means,  of  course,  that  the  equivalent  differential  equation 

~2~el  +  2ye2t  +  (yel  +  e2')- =  0 
is  exact.  Therefore,  there  exists  a  function  <f>(t,y)  such  that 

■2  d<t> 


y 


(i)  —e‘  +  2ye 


It  _ 


dt 


and 


(ii)  ye1  +  e2t  = 


d  <j> 


From  Equations  (i)  and  (ii), 


y 


<f>(t’y)=  +ye2‘ + h(y) 


and 


<t>(t,y)=  ^-et+ye2t  +  k(t). 

Thus,  h(y)=s  0,  k(t)  =  0  and  the  general  solution  of  the  differential  equation 
is 

<t>(t,y)=^-e‘+ye2‘  =  c. 

Solving  this  equation  for  y  as  a  function  of  t  we  see  that 

y(t)=  —  el ±  [e2t  +  2 ce~t]1^2. 


Example  7.  Use  the  methods  of  this  section  to  find  the  general  solution  of 
the  linear  equation  ( dy /  dt)  -I-  a(t)y  =  b(t). 

Solution.  We  write  this  equation  in  the  form  M(t,y)  +  N(t,y)(dy/dt)  =  0 
with  M(t,y)  =  a(t)y  —  b(t)  and  N(t,y)=  1.  This  equation  is  not  exact  since 
dM/dy  =  a{t)  and  dN/dt  =  0.  However,  ((dM / dy)- (dN/dt))/ N  =  a(t). 

Hence,  ju(t)  =  exp^J a(t)dtj  is  an  integrating  factor  for  the  first-order  lin¬ 
ear  equation.  Therefore,  there  exists  a  function  <l>(t,y)  such  that 


(i)  ix(t)[a(t)y-b(t)]  = 


d<j> 

~dt 
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and 

0<f> 

(ii) 

Now,  observe  from  (ii)  that  <^{t,y)  —  n(t)y  +  k(t).  Differentiating  this  equa¬ 
tion  with  respect  to  t  and  using  (i)  we  see  that 

ix'(t)y  +  k'(t)  =  n(t)a(t)y  -  n(t)b(t). 

But,  jLt'(r)  =  a(t)  n(t).  Consequently,  k'(t)=  -  \i(t)b(t)  and 

<t>(t,y)  =  n(t)y-  f  ft {t)b(t)dt. 

Hence,  the  general  solution  of  the  first-order  linear  equation  is 

fi(t)y  -  f  n(t)b{t)  dt  =  c, 
and  this  is  the  result  we  obtained  in  Section  1.2. 


Exercises 


1.  Use  the  theorem  of  equality  of  mixed  partial  derivatives  to  show  that  dM/dy 
=  3 N/dt  if  the  equation  M  (t,y)  +  N  (t,y)(dy / dt)  =  0  is  exact. 


2.  Show  that  the  expression  M(t,y)—  J (dN(t,y)/dt)dy  is  a  function  of  t  alone  if 
dM/dy  =  dN/dt. 


In  each  of  Problems  3-6  find  the  general  solution  of  the  given  differential 
equation. 

3.  Itsiny +y3 4 5 6 7 8 9 10ef +  (t2cosy +  3y2et)^  =0 

4.  1  +(1  +  ty)e,y  +  {\  +  t2ely)~f7  =0 


5.  ysec2r  +  sec/tan/  +  (2y  +  tan0^- =0 

6.  ^--2ye‘  +  (y-e‘)^=0 


In  each  of  Problems  7-11,  solve  the  given  initial-value  problem. 

7.  2ty3  +  3t2y2^=0,  y(l)=l 

8.  Itcosy  +  3t2y  +  (t3  —  f2siny  —  y)^  =  0,  y( 0)  =  2 

9.  3l2  +  4iy  +  (2y  +  2t2)^-  =0,  y(0)=l 

dv 

10.  >»(cos2/)e0’  — 2(sin2/)e<>’  +  2f +(f(cos2/)e0’  — 3)-^-  =0,  _y(0)=0 
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11.  3ty+y1  +  (t1+ty)^-=0,  y( 2)=1 


dy_ 

dt 


In  each  of  Problems  12-14,  determine  the  constant  a  so  that  the  equation 
is  exact,  and  then  solve  the  resulting  equation. 

12.  t  +ye2ty  +  atelty  =  0 

i  i  (at  + 1)  dy 

13.  4  +  -4  +  - - i — ~  ~r  —  0 


V 


r 


r 


dt 


dy 


14.  eat+*  +  3t2y2  +  (2yt3  +  eat+y)-^=0 

15.  Show  that  every  separable  equation  of  the  form  M(t)  +  N ( y)dy /dt =0  is  exact. 

16.  Find  all  functions  f(t)  such  that  the  differential  equation 

y2sin<  +yf(t)(dy/dt) =0 

is  exact.  Solve  the  differential  equation  for  these  f(t). 

17.  Show  that  if  (( dN/dt)  —  (dM/dy))/M=Q(y ),  then  the  differential  equation 
M(t,y)+  N(t,y)<fy/dJ—0  has  an  integrating  factor /»(>■) 

18.  The  differential  equation  f(t)(dy/dt)  +  t2+y  =0  is  known  to  have  an  integrat¬ 
ing  factor  n(t)=t.  Find  all  possible  functions /(/). 

19.  The  differential  equation  e'secy  —  tany  +  (dy / dt)  =  0  has  an  integrating  factor 
of  the  form  e~at cosy  for  some  constant  a.  Find  a ,  and  then  solve  the  differen¬ 
tial  equation. 

20.  The  Bernoulli  differential  equation  is  (dy / dt)  + a(t)y  =  b(t)yn.  Multiplying 

through  b,  ,(,).„p{/.(,)d),  we  can  rewrite  th,s  equation  in  the  (om 

d/dt(ii(t)y)  =  b(t)  fx(t)yn.  Find  the  general  solution  of  this  equation  by  find¬ 
ing  an  appropriate  integrating  factor.  Hint:  Divide  both  sides  of  the  equation 
by  an  appropriate  function  of  y. 


1.10  The  existence-uniqueness  theorem;  Picard  iteration 

Consider  the  initial-value  problem 

-jt y(t0)=yo  (!) 

where/ is  a  given  function  of  t  and>\  Chances  are,  as  the  remarks  in  Sec¬ 
tion  1.9  indicate,  that  we  will  be  unable  to  solve  (1)  explicitly.  This  leads  us 
to  ask  the  following  questions. 

1.  How  are  we  to  know  that  the  initial- value  problem  (1)  actually  has  a 
solution  if  we  can’t  exhibit  it? 
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2.  How  do  we  know  that  there  is  only  one  solution  y(t)  of  (1)?  Perhaps 
there  are  two,  three,  or  even  infinitely  many  solutions. 

3.  Why  bother  asking  the  first  two  questions?  After  all,  what’s  the  use  of 
determining  whether  (1)  has  a  unique  solution  if  we  won’t  be  able  to  ex¬ 
plicitly  exhibit  it? 


The  answer  to  the  third  question  lies  in  the  observation  that  it  is  never 
necessary,  in  applications,  to  find  the  solution  y  (t)  of  (1)  to  more  than  a 
finite  number  of  decimal  places.  Usually,  it  is  more  than  sufficient  to  find 
y(t)  to  four  decimal  places.  As  we  shall  see  in  Sections  1.13-17,  this  can  be 
done  quite  easily  with  the  aid  of  a  digital  computer.  In  fact,  we  will  be  able 
to  compute  y(t)  to  eight,  and  even  sixteen,  decimal  places.  Thus,  the 
knowledge  that  (1)  has  a  unique  solution  y(t)  is  our  hunting  license  to  go 
looking  for  it. 

To  resolve  the  first  question,  we  must  establish  the  existence  of  a  func¬ 
tion  y(t)  whose  value  at  t  =  t0  is  y0f  and  whose  derivative  at  any  time  t 
equals  f(t,y(t)).  In  order  to  accomplish  this,  we  must  find  a  theorem  which 
enables  us  to  establish  the  existence  of  a  function  having  certain  proper¬ 
ties,  without  our  having  to  exhibit  this  function  explicitly.  If  we  search 
through  the  Calculus,  we  find  that  we  encounter  such  a  situation  exactly 
once,  and  this  is  in  connection  with  the  theory  of  limits.  As  we  show  in  Ap¬ 
pendix  B,  it  is  often  possible  to  prove  that  a  sequence  of  functions yn(t)  has 
a  limity(0>  without  our  having  to  exhibit >>(7).  For  example,  we  can  prove 
that  the  sequence  of  functions 


z  v  sin7rt  ,  sin277/  ,  ,  sin  mt 

yn{t)= — —  + — —  +  ...+ 


1 


22 


tl‘ 


has  a  limit  y(t)  even  though  we  cannot  exhibit  y(t)  explicitly.  This  suggests 
the  following  algorithm  for  proving  the  existence  of  a  solution  y (t)  of  (1). 

(a)  Construct  a  sequence  of  functions  yn(t)  which  come  closer  and  closer 
to  solving  (1). 

(b)  Show  that  the  sequence  of  functions  yn(t)  has  a  limit y(t)  on  a  suitable 
interval  t0  <  t  <  /0+  a. 

(c)  Prove  that  y(t)  is  a  solution  of  (1)  on  this  interval. 

We  now  show  how  to  implement  this  algorithm. 


(a)  Construction  of  the  approximating  sequence  yn(t) 

The  problem  of  finding  a  sequence  of  functions  that  come  closer  and 
closer  to  satisfying  a  certain  equation  is  one  that  arises  quite  often  in 
mathematics.  Experience  has  shown  that  it  is  often  easiest  to  resolve  this 
problem  when  our  equation  can  be  written  in  the  special  form 


y{t)  =  L(t,y(t)),  (2) 


where  L  may  depend  explicitly  on  y,  and  on  integrals  of  functions  of  y. 
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For  example,  we  may  wish  to  find  a  function  y(t)  satisfying 

,v(0  =  l  +  sin[f +>>(/)], 
or 


y(t)  =  l+y2(t)  +  j^y3(s)ds. 


In  these  two  cases,  L(t,y(t))  is  an  abbreviation  for 

1  +  sin[/+y(7)] 


and 


\+y2(t)  +  f  y3(s)ds , 

•/o 

respectively. 

The  key  to  understanding  what  is  special  about  Equation  (2)  is  to  view 
L(t,y(t))  as  a  “machine”  that  takes  in  one  function  and  gives  back  another 
one.  For  example,  let 

•L(,>>'(0)  =  1+>'2(0+  f  y\s)ds. 

Jo 

If  we  plug  the  function  y(t)=  t  into  this  machine,  (that  is,  if  we  compute 

1  +  /2  +  fts3ds)  then  the  machine  returns  to  us  the  function  1  -I-  /2+  Z4/ 4.  If 
Jq 

we  plug  the  function y(t)  =  cost  into  this  machine,  then  it  returns  to  us  the 
function 


1  +  cos2 /  +  /  cos3sds  =  1  +  cos2/  +  sin/ - — . 

J  o  3 

According  to  this  viewpoint,  we  can  characterize  all  solutions  y(t)  of  (2)  as 
those  functions  y(t)  which  the  machine  L  leaves  unchanged.  In  other 
words,  if  we  plug  a  function  y(t)  into  the  machine  L,  and  the  machine  re¬ 
turns  to  us  this  same  function,  theny(f)  is  a  solution  of  (2). 

We  can  put  the  initial-value  problem  (1)  into  the  special  form  (2)  by  in¬ 
tegrating  both  sides  of  the  differential  equation  y'=  f(t,y)  with  respect  to  t. 
Specifically,  ify(0  satisfies  (1),  then 


so  that 


y{t)=y<>+  f  /(s>y(s))ds.  (3) 

J‘o 

Conversely,  if  y{t)  is  continuous  and  satisfies  (3),  then  dy / dt—f{t,y{t)). 
Moreover,  y(t0)  is  obviously  y0.  Therefore,  y(t)  is  a  solution  of  (1)  if,  and 
only  if,  it  is  a  continuous  solution  of  (3). 
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Equation  (3)  is  called  an  integral  equation,  and  it  is  in  the  special  form 
(2)  if  we  set 


jL(*O'(0)=>'o+ 


This  suggests  the  following  scheme  for  constructing  a  sequence  of  “ap¬ 
proximate  solutions ” yn(t)  of  (3).  Let  us  start  by  guessing  a  solution y0(t)  of 
(3).  The  simplest  possible  guess  is  To  check  whether  y0(t)  is  a 

solution  of  (3),  we  compute 

>'i(0=>;o+  f  f(s,y0(s))ds. 

If  JFi(0  then y(t)=y0  is  indeed  a  solution  of  (3).  If  not,  then  we  try 
yx(t)  as  our  next  guess.  To  check  whethery^/)  is  a  solution  of  (3),  we  com¬ 
pute 


yi(t)=yo+  f'  f(s,yi(s))ds , 


and  so  on.  In  this  manner,  we  define  a  sequence  of  functions  y  i(0. 
y2(t),...,  where 

yn+i(t)=yo+  f  f{s,y„(s))ds.  (4) 

J‘o 

These  functions  yn(t)  are  called  successive  approximations,  or  Picard 
iterates,  after  the  French  mathematician  Picard  who  first  discovered  them. 
Remarkably,  these  Picard  iterates  always  converge,  on  a  suitable  interval, 
to  a  solution  y(i)  of  (3). 

Example  1.  Compute  the  Picard  iterates  for  the  initial-value  problem 

y'=y ,  y(  o)  =  i, 

and  show  that  they  converge  to  the  solution y(t)  =  el. 

Solution.  The  integral  equation  corresponding  to  this  initial- value  problem 
is 


y(t)=\  +  j  y(s)ds. 

Hence,  y0{t)—  1 

yl(t)=l+  f  lds=l  +  t 
Jo 

y2(t)=l  +  yl(s)ds=  1+  f  (l  +  s)ds=  l  +  (+ 
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and,  in  general, 


^)"1  +  /o^-l(j)A"1+  f 

0  Jo 


,n-  1 


1  +  S  +  ...  + 


(«"!)! 


ds 


t2  tn 

=  1  4-  /  +  —  +  . . .  H - -  . 

21  n\ 

Since  e*  =  1 4-  /  + 12/ 2!  +  . . . ,  we  see  that  the  Picard  iterates  yn(t)  converge 
to  the  solution  y(t)  of  this  initial-value  problem. 


Example  2.  Compute  the  Picard  iterates  y\(t),y2(t)  for  the  initial-value 
problem  y'  =  1  +y3,  y(l)  =  1. 

Solution.  The  integral  equation  corresponding  to  this  initial- value  problem 
is 

y(f)=l  +  [  i+.y 3 (•*)]<&• 

Hence,  y0(t )  =  1 

^,(0-1  +  f‘(l  +  1)^=1  +  2(#-l) 

and 

-V2(0=l+  f'{l  +  [l+2(j-l)]3}(* 

=  l  +  2(r-  l)  +  3(/  -  l)2+4(/— 1)3  +  2(/—  l)4. 

Notice  that  it  is  already  quite  cumbersome  to  compute  y3(t). 


(b)  Convergence  of  the  Picard  iterates 

As  was  mentioned  in  Section  1.4,  the  solutions  of  nonlinear  differential 
equations  may  not  exist  for  all  time  t.  Therefore,  we  cannot  expect  the 
Picard  iterates yn(t)  of  (3)  to  converge  for  all  t.  To  provide  us  with  a  clue, 
or  estimate,  of  where  the  Picard  iterates  converge,  we  try  to  find  an  inter¬ 
val  in  which  all  th e  yn(t)  are  uniformly  bounded  (that  is,  \yn(t)\<K  for 
some  fixed  constant  K).  Equivalently,  we  seek  a  rectangle  R  which  con¬ 
tains  the  graphs  of  all  the  Picard  iterates yn(t).  Lemma  1  shows  us  how  to 
find  such  a  rectangle. 


Lemma  1.  Choose  any  two  positive  numbers  a  and  b ,  and  let  R  be  the  rectan¬ 
gle :  /0<  t<  t0+  a,  \y  ~y0\  <  b.  Compute 


M  =  max  \f(t,y)\,  and  set  a  =  min (tf,  tt). 
(t,y)  in  R]  K  71  \  MJ 


Then , 


(5) 


for  t0<  t  <  t0+a. 
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Lemma  1  states  that  the  graph  ol  yn(t)  is  sandwiched  between  the  lines 
y  =>>0  +  M  (t  —  /0)  and  y—y0-M{t  —  /0)>  for  t0  <  t  <  t0  4-  a.  These  lines 
leave  the  rectangle  R  at  t  =  t0  +  a  if  a<  b/ M,  and  at  t  =  tQ+b/ M  if  b/ M 
<a  (see  Figures  1  a  and  1  b).  In  either  case,  therefore,  the  graph  of  yn(t)  is 
contained  in  R  for  t0  <  t  <  /0+  a. 


Figure  1.  (a)  a  =  a ;  (b)  a  =  b/M 

Proof  of  lemma  1.  We  establish  (5)  by  induction  on  n.  Observe  first  that 
(5)  is  obviously  true  for  n  =  0,  since  Next,  we  must  show  that  (5) 

is  true  for  n  =j  +  1  if  it  is  true  for  n  =  j.  But  this  follows  immediately,  for  if 
b,(')-.Vol  <  M{t-t0),  then 

|j^+i(0— ^ol  =  f 'f(s,yj(s))ds 

J‘o 

<  f  lf(s,y,(s))\di<M(t-t0) 

J‘o 

for  t0  <  t  <  /0+  a.  Consequently,  (5)  is  true  for  all  «,  by  induction.  □ 

We  now  show  that  the  Picard  iterates yn(t)  of  (3)  converge  for  each  /  in 
the  interval  t0  <  t  <  t0+  a,  if  df/dy  exists  and  is  continuous.  Our  first  step  is 
to  reduce  the  problem  of  showing  that  the  sequence  of  functions yn(t)  con¬ 
verges  to  the  much  simpler  problem  of  proving  that  an  infinite  series  con¬ 
verges.  This  is  accomplished  by  writing yn(t)  in  the  form 

>'»(0,“^o(0+[^i(0->’o(03+ ••■  +  [>’i.(o-^-i(o]- 

Clearly,  the  sequence y„(t)  converges  if,  and  only  if,  the  infinite  series 

[MO-J'oW]  +  [>’2(0_>,i(0]  +  •••  +  [yn(t)~yn-i(t)]  +  •••  (6) 

converges.  To  prove  that  the  infinite  series  (6)  converges,  it  suffices  to 
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show  that 


00 


2  b»(0“>'»-i(OI<00- 

n=  1 

This  is  accomplished  in  the  following  manner.  Observe  that 


(7) 


\yn{t)-yn- i(OI  = 


< 


(s))  ~f(s,yn-2(s))  ]  ds 

Jto 


‘\  9/M*)) 

dy 


1 0 


where  £(5)  lies  between  7Al_1(^)  and  yn_ 2(s).  (Recall  that  f(xl)-f(x2)  = 
f'(£)(x\~  xi)>  where  £  is  some  number  between  xx  and  x2.)  It  follows  im¬ 
mediately  from  Lemma  1  that  the  points  (s,£(s))  all  lie  in  the  rectangle  R 
for  s  <  t0  +  a.  Consequently, 

\y„(t)-y„_l{t)\<L[  b#,_i(j)-yJ,_2(j)|*.  t0<t<t0  +  a,  (8) 

J<o 


where 


L  —  max 
( t,y )  in  R 


3 f(t,y) 
37 


(9) 


Equation  (9)  defines  the  constant  L.  Setting  n  =  2  in  (8)  gives 

|72(0-7i(0I  <  L  f  |71('S)-7ol^<  L  f  M  (s~  t0)ds 

LM  (t  —  t0)2 


This,  in  turn,  implies  that 


173(0-72(01  <lJ  \y2{s)-yl{s)\ds<ML1  f 

,o  4 


'  (s  -  t0y 


ds 


_  ML2(t-t0f 

=  3! 

Proceeding  inductively,  we  see  that 


n 


mlh~  1  (t—t  y 

l7„(0-7„-i(0l< - n - — ,  for  t0<  t  <  t0+  a. 


n\ 


(10) 
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Therefore,  for  t0  <  t  <  /0+  a, 
bi(0->'o(0l  +  b2(0->,i(0l  +  ••• 


ML(t-t0y  ML2(t-t0y 
<  M  (t  —  t0)  + - — - h 


2! 


^  „  ,  MLa 2  ,  ML2a 3  ^ 

<  Ma  H - — - 1 - — - 1-  . . . 


3! 


+ 


2! 


3! 


L 


r  ,  («^)2  ,  («L)3 
aL  H - - - 1 - — - 1-  . . . 


2! 


3! 


This  quantity,  obviously,  is  less  than  infinity.  Consequently,  the  Picard 
iterates  yn(t)  converge  for  each  t  in  the  interval  t0<  t<  t0+a.  (A  similar 
argument  shows  thaty„(/)  converges  for  each  t  in  the  interval  tQ  —  fi<t< 
t0 ,  where  fi  =  mm(a,b/ N),  and  N  is  the  maximum  value  of  \f(t,y)\  for 
(t,y)  in  the  rectangle  t0-  a  <  t  <  t0, \y -y0\  <  b.)  We  will  denote  the  limit  of 
the  sequence yn(t)  by y(t).  □ 


(c)  Proof  that  y(t)  satisfies  the  initial-value  problem  (1) 

We  will  show  thaty(/)  satisfies  the  integral  equation 

y(t)=y<>+  f  f(s>y(s))<&  (ii) 

J'o 

and  that  y(t)  is  continuous.  To  this  end,  recall  that  the  Picard  iterates  yjj) 
are  defined  recursively  through  the  equation 

yn+\{t)=yo+  f  f(s,y„(s))ds.  (12) 

J‘o 

Taking  limits  of  both  sides  of  (12)  gives 

y{t)=y0+  lim  f/(s,y„(s)) ds.  (13) 

*0 

To  show  that  the  right-hand  side  of  (13)  equals 

y0+  f  f(s,y(s))ds, 

J‘o 

(that  is,  to  justify  passing  the  limit  through  the  integral  sign)  we  must  show 
that 

f  f(s,y(s))ds-  f  f(s,yn(s))ds 

J‘o  J‘o 

approaches  zero  as  n  approaches  infinity.  This  is  accomplished  in  the 
following  manner.  Observe  first  that  the  graph  of  y(t)  lies  in  the  rectangle 
R  for  t  <  t0+  a,  since  it  is  the  limit  of  functions yn(t)  whose  graphs  lie  in  R. 
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Hence 


f‘f(s,y(s))ds-  f  f(s,y„(s))ds 

J'o  J‘o 


<f]f(s,y(s))-f(s,y„(s))\ds<Lf'  |y(^) 
where  L  is  defined  by  Equation  (9).  Next,  observe  that 


j  =  n  +  1 


since 


00 


.y(5)=.yo+  2 

j- i 


and 


n 


2 \yj(s)-yj-\ (*)]• 


Consequently,  from  (10), 


j  =  n+\ 
oo 


< 


M  2 


J 


U~  laJ  M 


00 


j=n+  1 


J 


r! 


2 

j  =  n  +  1 


(«Ly 


L  .  j\ 


(14) 


and 


^0 


(aL)J 


f‘f(s,y(s))ds-  f  f(s,y„(s))ds  <  M  2  -—JT~  f 

J'-  J,o  j-n+l  J '  '« 

M  V  (aL)J 

<  A/a  ^  ^ —  . 


j**n+  1 


7 


This  summation  approaches  zero  as  n  approaches  infinity,  since  it  is  the 
tail  end  of  the  convergent  Taylor  series  expansion  of  eaL.  Hence, 

lim  f  f(s,y„(s))ds=  f  f(s,y(s))ds, 

"_*00  *0  Jt0 

andy(0  satisfies  (11). 

To  show  that  y(0  is  continuous,  we  must  show  that  for  every  e>0  we 
can  find  S  >  0  such  that 

\y(t  +  h)-y(t)\<e  if  |A|<5. 

Now,  we  cannot  compare  y (t  +  h)  with  y(t)  directly,  since  we  do  not  know 
y{t)  explicitly.  To  overcome  this  difficulty,  we  choose  a  large  integer  N  and 
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observe  that 

y(t  +  h)-y(t)=  [y(t  +  h)-yN(t  +  h)] 

+  [yN(t+h)-yN(t)]  +  [.MO -.HO]- 

Specifically,  we  choose  N  so  large  that 


Then,  from  (14), 


£  2 

j~  N  +  \ 


\y(t  +  h)-yN(t  +  h)\<j-  and  |M0  “.HOI <| > 


for  t  <  /0+  a,  and  h  sufficiently  small  (so  that  t  +  h  <  /0+  a.)  Next,  observe 
th&t  yN(t)  is  continuous,  since  it  is  obtained  from  N  repeated  integrations 
of  continuous  functions.  Therefore,  we  can  choose  S  >0  so  small  that 


\yN(t  +  h)-yN(t)\<^  for  \h\ < S. 

Consequently, 

\y +  h) - y (t)\  <  \y(t  +  h)-yN(t  +  h)\  +  \yN(t  +  h)-yN(t)\ 

+  I.M0  “7(01  <J  +  f  +  f  =e 

for  \h\ <  S.  Therefore,  y(t)  is  a  continuous  solution  of  the  integral  equation 
(1 1),  and  this  completes  our  proof  thaty(/)  satisfies  (1).  □ 


In  summary,  we  have  proven  the  following  theorem. 


Theorem  2.  Let  f  and  3// 3y  be  continuous  in  the  rectangle  R :  t0<  t  <  t0  +  a, 
\y  “Tol  ^  Compute 

M=  max  \f(t,y)\,  and  set  a  =  min(a,  ). 

(O’)  in*  V  M) 

Then ,  the  initial-value  problem  y'  =f(t,y),  y(t0)=y0  has  at  least  one  solu¬ 
tion  y(t)  on  the  interval  t0  <  t<  t0+  a.  A  similar  result  is  true  for  t  <  t0. 

Remark.  The  number  a  in  Theorem  2  depends  specifically  on  our  choice 
of  a  and  b.  Different  choices  of  a  and  b  lead  to  different  values  of  a. 
Moreover,  a  doesn’t  necessarily  increase  when  a  and  b  increase,  since  an 
increase  in  a  or  b  will  generally  result  in  an  increase  in  M. 


Finally,  we  turn  our  attention  to  the  problem  of  uniqueness  of  solutions 
of  (1).  Consider  the  initial- value  problem 

^=(sin2  0//3,  7(0)  =  0.  (15) 

One  solution  of  (15)  is.y(t)  =  0.  Additional  solutions  can  be  obtained  if  we 
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ignore  the  fact  thaty(0)  =  0  and  rewrite  the  differential  equation  in  the 


form 

1  dy 

d,  ■s,n2'- 

or 

d  3>-2/J  .  ,, 

,  ^  =sin2/. 

dt  2 

Then, 

3 y2/3  1  —  cos  It 

2  2 

and y  =  ±  Vo/27  sin3  /  are  two  additional  solutions  of  (15). 

Now,  initial-value  problems  that  have  more  than  one  solution  are 
clearly  unacceptable  in  applications.  Therefore,  it  is  important  for  us  to 
find  out  exactly  what  is  “wrong”  with  the  initial-value  problem  (15)  that  it 
has  more  than  one  solution.  If  we  look  carefully  at  the  right-hand  side  of 
this  differential  equation,  we  see  that  it  does  not  have  a  partial  derivative 
with  respect  toy  aty  =  0.  This  is  indeed  the  problem,  as  the  following  theo¬ 
rem  shows. 

Theorem  2'.  Let  f  and  df/dy  be  continuous  in  the  rectangle  R  '-t0<  t  <  t0  +  a, 
|y-y0|<  6.  Compute 

M  =  max  |/(/,y)|,  and  set  a  =  mini  tf,  ttV 

(t,y)  in  RUX  V  MJ 

Then ,  the  initial- value  problem 

y'=f{uy),  y(.t0)=y0  O6) 

has  a  unique  solution  y(t)  on  the  interval  t0  <  t  <  /0+  a.  In  other  words ,  if 
y{t)  and  z(t)  are  two  solutions  of  ( 16),  then  y(t)  must  equal  z(t)  for  t0<t 

K  /q  +  OL. 

Proof.  Theorem  2  guarantees  the  existence  of  at  least  one  solution  y(t)  of 
(16).  Suppose  that  z(t)  is  a  second  solution  of  (16).  Then, 

y(t)=y<>+  f  f(s,y(s))ds  and  z(t)=y0+  f  f(s,z(s))ds. 

J<o 

Subtracting  these  two  equations  gives 


b(0_z(0l" 


f  [f(s,y(s))-f(s,z(s))]ds 


<f  \f(s,y(s))-f(s,z(s))lds 
Jto 

<  Lf'  |7(j)-z(s)|<fe 


77 


1  First-order  differential  equations 


where  L  is  the  maximum  value  of  |3//9y|  for  (t,y)  in  R.  As  Lemma  2  be¬ 
low  shows,  this  inequality  implies  that  y(t)  =  z(t).  Hence,  the  initial-value 
problem  (16)  has  a  unique  solutiony(f).  □ 

Lemma  2.  Let  w(t)  be  a  nonnegative  function,  with 

w(t)<Lf  w(s)ds.  (17) 

Then ,  w(t)  is  identically  zero. 

Fake  proof.  Differentiating  both  sides  of  (17)  gives 

or  ^-Lw(0<0. 

Multiplying  both  sides  of  this  inequality  by  the  integrating  factor 
gives 

J^-e“L^_'°V(f)  <  0,  so  that  e~L(<t~to^w{t)  <  w(f0) 

for  t>  t0.  But  w(f0)  must  be  zero  if  w(t)  is  nonnegative  and  satisfies  (17). 
Consequently,  e~L(t~to)w(t)  <  0,  and  this  implies  that  w(t)  is  identically 
zero. 

The  error  in  this  proof,  of  course,  is  that  we  cannot  differentiate  both 
sides  of  an  inequality,  and  still  expect  to  preserve  the  inequality.  For  exam¬ 
ple,  the  function  fx(t)  =  2t  —  2  is  less  than  f2(t)=t  on  the  interval  [0, 1],  but 
f{(t)  is  greater  than  f^t)  on  this  interval.  We  make  this  proof  “kosher”  by 
the  clever  trick  of  setting 


U(t)=  f  w(s)ds. 

Then, 

=  w(t)  <L  f  w(s)ds=  LU (t). 
at  JtQ 

Consequently,  e~L(t~to) U(t)<  U(t0)  =  0,  for  t  >  t0,  and  thus  U(t)  =  0.  This, 
in  turn,  implies  that  w(/)  =  0  since 

0 <w(t)<Lf  w(s)ds=  LU (t)  —  0.  □ 


Example  3.  Show  that  the  solution  y(t)  of  the  initial- value  problem 

,(0)-0 

exists  for  0  <  t  <  and  in  this  interval,  <  1. 
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Solution .  Let  R  be  the  rectangle  0<  /  <  |y|  <  1.  Computing 

M—  max  t2  +  e~y2  =  l  +  (^)2  =  j, 

(t,y)  in  R  V  7 

we  see  thaty(/)  exists  for 

and  in  this  interval,  \y(t)\  <  1. 

Example  4.  Show  that  the  solution  y(t)  of  the  initial- value  problem 

^  =  e~‘1+y\  >'(°)  =  1 

exists  for  0  <  t  <  1  /9,  and  in  this  interval,  0  <  y  <  2. 

Solution.  Let  R  be  the  rectangle  0  <  /  <  £,  0<  y  < 2.  Computing 

M=  max  e~'2+y3=  1  +  23  =  9, 

( t,y )  in  R 

we  see  that  y  (t)  exists  for 

0<  /<min(|,|) 
and  in  this  interval,  0  <  y  <  2. 

Example  5.  What  is  the  largest  interval  of  existence  that  Theorem  2  pre¬ 
dicts  for  the  solution  y  (7)  of  the  initial-value  problem  y'  =  1  +y2,  y(0)  =  0? 
Solution.  Let  R  be  the  rectangle  0  <  t<a,  Computing 

M=  max  1  +y2=  1  +  b1, 

(t,y)  in  R 

we  see  that  >>(7)  exists  for 

0  <  t  <  a  =  mini  a ,  — |. 

V  l  +  b2) 

Clearly,  the  largest  a  that  we  can  achieve  is  the  maximum  value  of  the 
function  b/(l  +  b2).  This  maximum  value  is  Hence,  Theorem  2  predicts 

that  y  (t)  exists  for  0  <  t  <  \.  The  fact  that  y  (/)  =  tan  t  exists  for  0  <  t  <  tt/2 
points  out  the  limitation  of  Theorem  2. 

Example  6.  Suppose  that  \f(t9y)\  <  K  in  the  strip  t0  <  t  <  oo,  —  oo  <y  <  oo. 
Show  that  the  solution  y(t)  of  the  initial-value  problem  y'  =f(t,y),  y(t0)  = 
y0  exists  for  all  t  >  t0. 

Solution.  Let  R  be  the  rectangle  tQ  <  t  <  f0  +  a,  | y  —y0\  <  b.  The  quantity 

M=  max  l/(o>)l 

(t,y)  in  R 
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is  at  most  AT.  Hence,  y{t)  exists  for 

t0<t<  t0+min(a,b/ K). 

Now,  we  can  make  the  quantity  min(a,b /  K)  as  large  as  desired  by  choos¬ 
ing  a  and  b  sufficiently  large.  Therefore  y(t)  exists  for  t  >  t0. 

Exercises 

1.  Construct  the  Picard  iterates  for  the  initial-value  problem  y'  =  2t(y  + 1),  y(0)  = 
0  and  show  that  they  converge  to  the  solution y(t)  =  et  —  1. 

2.  Compute  the  first  two  Picard  iterates  for  the  initial-value  problem  y'  =  /2+y2, 

y(  0)=i. 

3.  Compute  the  first  three  Picard  iterates  for  the  initial- value  problemy'  =  e'+y2, 

y(0)=o. 

In  each  of  Problems  4—15,  show  that  the  solution  y(t)  of  the  given  initial- 
value  problem  exists  on  the  specified  interval. 

4.  y' =*y2  +  cost2,  .y(0)  =  0; 

5.  y'  =  1  +y  +y2cost,  y(0)  =  0;  0</<y 

6.  y'=  t  +  y2,y( 0)  =  0;  0  <  t  <  (j)2/3 

7.  y'  =  e~‘2+y2,  y( 0)  =  0;  0 

8.  y=  e~'2  +y2,y(l)  =  0;  1  <  t  <  1  +  Ve  /2 

9.  y'  =  e~‘2+y2,  y(0)=l;  0 </< - ^ - 

1+(1  +  V2  )2 

10.  y'=y  +  e~y  +  e~‘,  y( 0)  =  0;  0</<l 

11.  y'=y3  +  e~5t,  y(0)  =  0.4;  0  <  t  < 

12 . y  =  e°~,)2,  y(0)  =  1;  0<?<  ^  e-((»  + V3)/2)2 

13.  y  =  (4y  +  e-'2)e2',  y( 0)  =  0;  0<r<— !— 

8Ve 

14.  y'  =  e~'  +  ki(l+y2),  >’(0)=0;  0</<oo 

15.  ^'  =  j(l+cos4/)^  — ^(1  — cos4/)>»2,  _y(0)=  100;  0</<l 

16.  Consider  the  initial-value  problem 

y'=t2+y2,  y  (0)  =  0,  (*) 

and  let  R  be  the  rectangle  0  <  /  <  a ,  —  b  <  y  <  b. 

(a)  Show  that  the  solution  y(t)  of  (*)  exists  for 

0<  t  <  min(  a,  - 

\  a2+b2) 
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(b)  Show  that  the  maximum  value  of  b/(a2  +  b2),  for  a  fixed,  is  1/2 a. 

(c)  Show  that  a  =  min(tf,  \a)  is  largest  when  a  =  1/ V2  . 

(d)  Conclude  that  the  solution  y{t)  of  (*)  exists  for  0  <  t  <  1/ V2  . 

17.  Prove  thaty(/)  =  —  1  is  the  only  solution  of  the  initial- value  problem 

/  =  ;(l+y),  y(  0)=-l. 

18.  Find  a  nontrivial  solution  of  the  initial- value  problem  y' —  tya,  j>(0)  =  0,  a>  1. 
Does  this  violate  Theorem  2'?  Explain. 

19.  Find  a  solution  of  the  initial- value  problem  y'  =  t  V 1  -y2  ,y(0)  =  1,  other  than 
y(t)=  1.  Does  this  violate  Theorem  2'?  Explain. 


20.  Here  is  an  alternate  proof  of  Lemma  2.  Let  w(t)  be  a  nonnegative  function 
with 

w(t)<Lf  w(.s)d!s  (*) 

•'fn 


on  the  interval  t0  <  t  <  t0+  a.  Since  w{t)  is  continuous,  we  can  find  a  constant 
A  such  that  0 <  w(t)  <  A  for  t0<  t  <  t0+  a. 

(a)  Show  that  w(t)<  LA  (t-  t0). 

(b)  Use  this  estimate  of  w(t)  in  (*)  to  obtain 


w(t)  < 


AL2(t-t0y 


(c)  Proceeding  inductively,  show  that  w(t)  <  ALn(t  —  t0)n / n\,  for  every  integer 
n. 

(d)  Conclude  that  w(/)  =  0  for  t0  <  /  <  r0+  a. 


1.11  Finding  roots  of  equations  by  iteration 

Suppose  that  we  are  interested  in  finding  the  roots  of  an  equation  having 
the  special  form 

x=f(x).  (1) 

For  example,  we  might  want  to  find  the  roots  of  the  equation 

x  =  sinx  + 

The  methods  introduced  in  the  previous  section  suggest  the  following  algo¬ 
rithm  for  solving  this  problem. 

1.  Try  an  initial  guess  x0,  and  use  this  number  to  construct  a  sequence  of 

guesses  where  -^i “/ (-^o)’  *^2=./X^'iX  -^3 (-^'2)’  and  so  on. 

2.  Show  that  this  sequence  of  iterates  xn  has  a  limit  rj  as  n  approaches  in¬ 
finity. 

3.  Show  that  r\  is  a  root  of  (1);  i.e.,  tj=/(t0. 

The  following  theorem  tells  us  when  this  algorithm  will  work. 

Theorem  3.  Let  f  (x)  and  f'(x)  be  continuous  in  the  interval  a  <  x  <  b,  with 
|/'C*)I  1  in  this  interval .  Suppose ,  moreover ,  that  the  iterates  xn,  de- 
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fined  recursively  by  the  equation 

*»+i  =/K)  (2) 

all  lie  in  the  interval  [a,b].  Then ,  the  iterates  xn  converge  to  a  unique  num¬ 
ber  rj  satisfying  (1). 

Proof.  We  convert  the  problem  of  proving  that  the  sequence  xn  converges 
to  the  simpler  problem  of  proving  that  an  infinite  series  converges  by  writ¬ 
ing  xn  in  the  form 

Clearly,  the  sequence  xn  converges  if,  and  only  if,  the  infinite  series 

00 

(X]  Xq)  +  (x2  +  ...  +  (x„  i)  +  ...  =  2 

n  =  1 

converges.  To  prove  that  this  infinite  series  converges,  it  suffices  to  show 
that 

00 

l*l“*ol  +  l*2“*ll+  •••  =  2  |-XW  —  i|  <  00. 

n  *  1 

This  is  accomplished  in  the  following  manner.  By  definition,  xn=f(xn_x) 
and  xn_x—f{xn_f).  Subtracting  these  two  equations  gives 

x„  -  X„_i  =f(x„_l)  -f(x„_2)  -/'(€)(*„_  1  -  X„_2 ), 

where  £  is  some  number  between  xrl_l  and  x„_2.  In  particular,  £  is  in  the 
interval  [ a,b ].  Therefore,  |/'(£)|  <  A,  and 

J  <  A|xc„_  ,  —  JC„_2|.  (3) 

Iterating  this  inequality  n  —  1  times  gives 

<X2k-2-*„-3l 

<Xn_1|xi-x0|- 


Consequently, 


00 

2  k 


n=\ 


<  2  A" 

n  =  \ 


xx-x0  1 +  A+A2  + 


1  -A 


This  quantity,  obviously,  is  less  than  infinity.  Therefore,  the  sequence  of 
iterates  xn  has  a  limit  rj  as  n  approaches  infinity.  Taking  limits  of  both 


82 


1.11  Finding  roots  of  equations  by  iteration 


sides  of  (2)  gives 

1  =  ]im^+1  =  nlim  f(xn)  =/(i?). 

Hence,  77  is  a  root  of  (1). 

Finally,  suppose  that  rj  is  not  unique;  that  is,  there  exist  two  solutions  tj1 
and  tj2  of  (1)  in  the  interval  [a,b\.  Then, 

Vi  -  1?2=/(17i)  -f(v2)  =f(Z)(rh  -  Vz)> 

where  £  is  some  number  between  y\x  and  tj2.  This  implies  that  =  tj2  or  /'(I) 
=  1.  But /'(£)  cannot  be  one,  since  £  is  in  the  interval  [a9b].  Therefore,  77,  = 

*)2-  □ 

Example  1.  Show  that  the  sequence  of  iterates 

jc0,  xx  =  1  +  |arc  tanx0,  x2  =  1  +  ^arc  tanx„ . . . 
converge  to  a  unique  number  77  satisfying 

rj  =  l  +  ^arctanTj 


for  every  initial  guess  x0. 

Solution .  Let  f(x)=  1  +  \  arc  tan  x.  Computing  f\x)  =  \  1/(1  H-x2),  we  see 

that  |/'(*)|  is  always  less  than  or  equal  to  Hence,  by  Theorem  3,  the 
sequence  of  iterates  x0,xvx2, ...  converges  to  the  unique  root  rj  of  the 
equation  x=  1  +  ^arctanx,  for  every  choice  of  x0. 

There  are  many  instances  where  we  know,  a  priori,  that  the  equation 
x  =  f(x)  has  a  unique  solution  77  in  a  given  interval  [a9  b ].  In  these  in¬ 
stances,  we  can  use  Theorem  3  to  obtain  a  very  good  approximation  of  77. 
Indeed,  life  is  especially  simple  in  these  instances,  since  we  don’t  have  to 
check  that  the  iterates  xn  all  lie  in  a  specified  interval.  If  x0  is  sufficiently 
close  to  77,  then  the  iterates  xn  will  always  converge  to  77,  as  we  now  show. 

Theorem  4.  Assume  that  f(j])  =  77,  and  that  \f'(x)\  <  A  <  1  in  the  interval 
\x  —  77I  <  a .  Choose  a  number  x0  in  this  interval.  Then ,  the  sequence  of 
iterates  xn,  defined  recursively  by  the  equation  xn+l  =f(xn),  will  always 
converge  to  77. 

Proof.  Denote  the  interval  \x  —  77 1  <  a  by  /.  By  Theorem  3,  it  suffices  to 
show  that  all  the  iterates  xn  lie  in  I.  To  this  end,  observe  that 

xj+ 1  -  v  =/(*,)  ~f(v)  =f'(£)(xj  -  v) 

where  £  is  some  number  between  xy-  and  ij.  In  particular,  £  is  in  /  if  x-  is  in 
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/.  Thus, 


|xy+i-rj|<X|xy-r||<|xy-T)|  (4) 

if  Xj  is  in  I.  This  implies  that  xJ+l  is  in  I  whenever  Xj  is  in  /.  By  induction, 
therefore,  all  the  iterates  xn  lie  in  /.  □ 

Equation  (4)  also  shows  that  xn+l  is  closer  to  rj  than  xn.  Specifically,  the 
error  we  make  in  approximating  rj  by  xn  decreases  by  at  least  a  factor  of  X 
each  time  we  increase  n .  Thus,  if  X  is  very  small,  then  the  convergence  of  xn 
to  7]  is  very  rapid,  while  if  X  is  close  to  one,  then  the  convergence  is  very 
slow. 

Example  2. 

(a)  Show  that  the  equation 


*  =  sin*  +  \ 


has  a  unique  root  77  in  the  interval  [77/4,77/2]. 

(b)  Show  that  the  sequence  of  numbers 

x0 ,  *1  =  sin*0+^,  x2  =  sinx1  +  ^,... 

will  converge  to  77  if  77/4  <  x0  <  77 /2. 

(c)  Write  a  computer  program  to  evaluate  the  first  N  iterates  *j,*2, ...,xN. 
Solution. 

(a)  Let  g(*)  =  *  — sin*  — and  observe  that  g(77/4)  is  negative  while 

#(77/2)  is  positive.  Moreover,  g(x)  is  a  monotonic  increasing  function  of  * 
for  77/4  <  77/2,  since  its  derivative  is  strictly  positive  in  this  interval. 

Therefore,  Equation  (5)  has  a  unique  root  x  =  rj  in  the  interval  it/A<x< 
77/2. 

(b)  Let  I  denote  the  interval  17  —  77/4 <*<77  +  77/4.  The  left  endpoint  of 
this  interval  is  greater  than  zero,  while  the  right  endpoint  is  less  than  377/4. 
Hence,  there  exists  a  number  X,  with  0<A<  1,  such  that 


cos*  = 


dx 


(sinx  +  I) 


<\ 


for  x  in  I.  Clearly,  the  interval  [ir/4,w/2]  is  contained  in  1.  Therefore,  by 
Theorem  4,  the  sequence  of  numbers 


*0,  *1  =  sin*0+^,  *2  =  8^*!  +  ^,... 


will  converge  to  rj  for  every  *0  in  the  interval  [77/4,77/2]. 


84 


1.11  Finding  roots  of  equations  by  iteration 


Pascal  Program 


Program  Iterate  (input,  output); 
var 

X:  array [0. .  199]  of  real; 
k,  N:  integer; 

begin 

readln(X[0],  N); 
page; 

writeln(‘N’:4,  ‘X[N]’:14); 
for  k  :=0  to  N  do 
begin 

writeln(K:4,  ‘  ’:4,  X[k]:17:9); 
X[k  +  1]  :=0.25  +  sin(X[k]); 
end; 

end. 


Fortran  Program 


10 

C 


20 

C 


30 

40 


DIMENSION  X(200) 

READ  (5,10)  X0,  N 
FORMAT  (FI  5.8, 15) 

COMPUTE  X(1)  FIRST 
X(1 )  =  0.25  +  SIN(XO) 

KA  =  0 
KB  =  1 

WRITE  (6, 20)  KA,  X0,  KB,  X(1 ) 

FORMAT  (1  HI  ,4X,  ‘N’,  10X,  ‘X’/(1  H,3X,  13, 4X,  FI 5.9)) 
COMPUTE  X(2)  THRU  X(N) 

DO  40  K  =  2,  N 

X(K)  =  0.25  +  SIN(X(K  - 1 )) 

WRITE  (6,30)  K,  X(K) 

FORMAT  (1  H,3X,  13, 4X,  FI 5.9) 

CONTINUE 
CALL  EXIT 
END 


See  also  C  Program  1  in  Appendix  C  for  a  sample  C  program. 
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Table  1 


n 

Xn 

n 

xn 

0 

1 

8 

1.17110411 

1 

1.09147099 

9 

1.17122962 

2 

1.13730626 

10 

1.17122964 

3 

1.15750531 

11 

1.17122965 

4 

1.16580403 

12 

1.17122965 

5 

1.16910543 

13 

1.17122965 

6 

1.17040121 

14 

1.17122965 

7 

1.17090706 

15 

1.17122965 

In  many  instances,  we  want  to  compute  a  root  of  the  equation  x=f(x) 
to  within  a  certain  accuracy.  The  easiest,  and  most  efficient  way  of  accom¬ 
plishing  this  is  to  instruct  the  computer  to  terminate  the  program  at  k  =/  if 
Xj+\  agrees  with  Xj  within  the  prescribed  accuracy. 


Exercises 


1.  Let 

(a) 

(b) 

(c) 


2.  (a) 


(b) 
3.  (a) 


(b) 

4.  (a) 


tj  be  the  unique  root  of  Equation  (5). 

Let  x0  =  77 /4.  Show  that  20  iterations  are  required  to  find  tj  to  8  significant 
decimal  places. 

Let  *0  =  77/2.  Show  that  20  iterations  are  required  to  find  tj  to  8  decimal 
places. 

Let  x0  =  37t/8.  Show  that  16  iterations  are  required  to  find  tj  to  8  decimal 
places. 

Determine  suitable  values  of  *0  so  that  the  iterates  x„ ,  defined  by  the  equa¬ 
tion 

will  converge  to  V2  . 

Choose  *o=  1.4.  Show  that  14  iterations  are  required  to  find  V2  to  8  signifi¬ 
cant  decimal  places.  (V2  =  1.41421356  to  8  significant  decimal  places.) 

Determine  suitable  values  of  x0  so  that  the  iterates  x„,  defined  by  the  equa¬ 
tion 

*n+  1  Xn  ~  To(’*/i  — 

will  converge  to  Vl  . 

Choose  *0=  1.4.  Show  that  30  iterations  are  required  to  find  V2  to  6  signifi¬ 
cant  decimal  places. 

Determine  a  suitable  value  of  a  so  that  the  iterates  xni  defined  by  the  equa¬ 
tion 

x0=\J 

will  converge  to  VT  . 

Find  VJ  to  6  significant  decimal  places. 
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5.  Let  rj  be  the  unique  root  of  the  equation  x  =  14-  \  arc  tan  x.  Find  rj  to  5  signifi¬ 
cant  decimal  places. 

6.  (a)  Show  that  the  equation  2  —  x  =  (lnx)/4  has  a  unique  root  x  =  rj  in  the  inter¬ 

val  0<  x  <  oo. 

(b)  Let 

xn  +  i  2  (In  /4,  n  0, 1,2,... 

Show  that  1  <  xn  <  2  if  1  <  x0  <  2. 

(c)  Prove  that  x„^tj  as  oo  if  1  <  x0  <  2. 

(d)  Compute  rj  to  5  significant  decimal  places. 

7.  (a)  Show  that  the  equation  x  =  cos*  has  a  unique  root  x  =  tj  in  the  interval  0  <  x 

<  1. 

(b)  Let  xw  +  1  =  cosx„,  «  =  0, 1,2,...,  with  0<xo<l.  Show  that  0<xw<l.  Con¬ 
clude,  therefore,  that  x„— as  n-+  oo. 

(c)  Find  r]  to  5  significant  decimal  places. 


LI  LI  Newton's  method 


The  method  of  iteration  which  we  used  to  solve  the  equation  x=/(x)  can 
also  be  used  to  solve  the  equation  g(x)  =  0.  To  wit,  any  solution  x  =  tj  of 
the  equation  g(x)  =  0  is  also  a  solution  of  the  equation 

x=f(x)  =  x-g(x),  (1) 

and  vice-versa.  Better  yet,  any  solution  x  =  rr\  of  the  equation  g(x)  =  0  is 
also  a  solution  of  the  equation 

,,  x  g(*) 

x=f(x)  =  x-  — —  (2) 

h{x) 

for  any  function  h(x).  Of  course,  h(x)  must  be  unequal  to  zero  for  x  near 

Equation  (2)  has  an  arbitrary  function  h(x)  in  it.  Let  us  try  and  choose 
h(x)  so  that  (i)  the  assumptions  of  Theorem  4,  Section  1.11  are  satisfied, 
and  (ii)  the  iterates 

g(*o)  s(*i) 

Xq,  Xj  Xq  ,  x2  Xj  v  >••• 

h(x0)  h{xx) 

converge  as  “rapidly  as  possible”  to  the  desired  root  tj.  To  this  end,  we 
compute 


g'(x)  /»'(*)  g(*) 

h(x)  h2(x ) 


and  observe  that 


f(v)= 1- 


g'(v) 

h(v)  ■ 
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This  suggests  that  we  set  h(x)  =  g'{x\  since  then  =  0.  Consequently, 
the  iterates  xn ,  defined  recursively  by  the  equation 


g(*») 

g'(Xn)  ’ 


n  —  0, 1,2, ... 


will  converge  to  rj  if  the  initial  guess  x0  is  sufficiently  close  to  rj.  (If  /'(t?)  = 
0,  then  |/'(x)|<X<l  for  \x  —  r)\  sufficiently  small.)  Indeed,  the  choice  of 
h(x)=f'(x)  is  an  optimal  choice  of  h(j c),  since  the  convergence  of  xn  to  rj 
will  be  extremely  rapid.  This  follows  immediately  from  the  fact  that  the 
number  A  in  Equation  4,  Section  1.11  can  be  taken  arbitrarily  small,  as  xn 
approaches  rj. 

The  iteration  scheme  (3)  is  known  as  Newton’s  method  for  solving  the 
equation  g(x)  =  0.  It  can  be  shown  that  if  g(y)  =  0,  and  x0  is  sufficiently 
close  to  rj ,  then 


for  some  positive  constant  c.  In  other  words,  the  error  we  make  in  ap¬ 
proximating  rj  by  xn+l  is  proportional  to  the  square  of  the  error  we  make 
in  approximating  rj  by  xn.  This  type  of  convergence  is  called  quadratic  con¬ 
vergence,  and  it  implies  that  the  iterates  xn  converge  extremely  rapidly  to 
rj.  In  many  instances,  only  five  or  six  iterations  are  required  to  find  rf  to 
eight  or  more  significant  decimal  places. 


Example  1.  Use  Newton’s  method  to  compute  V2  . 

Solution .  The  square  root  of  two  is  a  solution  of  the  equation 

g(x)  =  x2  — 2  =  0. 

Hence,  Newton’s  scheme  for  this  problem  is 


az  =  0,  1,2, ... . 


Sample  Pascal  and  Fortran  programs  to  compute  the  first  N  iterates  of  an 
initial  guess  x0  are  given  below. 


Pascal  Program 

Program  Newton  (input,  output); 
var 

X:  array [0 . .  199]  of  real; 
k,  N:  integer; 

begin 

readln(X[0],  N); 
page; 
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writeln(4N,:4,  ‘X[N]’:14); 
for  k  :=0  to  N  do 
begin 

writeln(K:4,  ‘  :4,  X[k]:17:9); 
X[k  + 1]  :=X[k]/2  +  1/X[k]; 
end; 

end. 


Table  1 


n 

n 

x„ 

0 

1.4 

3 

1.41421356 

1 

1.41428571 

4 

1.41421356 

2 

1.41421356 

5 

1.41421356 

Fortran  Program 


We  need  only  replace  the  instructions  for  computing  X(1)  and  X(K)  in  the 
Fortran  program  of  Section  1.11  by 


X(1)  =  (X0/2)  +  1/X0 

and 

X(K)  =  (X(K  - 1  )/2)  + 1  /X(K  - 1 ) 

We  ran  these  programs  for  x0=  1.4  and  N  =  5,  and  the  results  are  given  in 
Table  1.  Notice  that  Newton’s  method  requires  only  2  iterations  to  find 

V2  to  eight  significant  decimal  places. 

See  also  C  Program  2  in  Appendix  C  for  a  sample  C  program. 


Example  2.  Use  Newton’s  method  to  find  the  impact  velocity  of  the  drums 
in  Section  1.7. 

Solution.  The  impact  velocity  of  the  drums  satisfies  the  equation 


/  x  ,  300 eg  w-B , 
*(»)  =  ®  +  — jjy-  +  ~^"ln 


W-B-cv 

W-B 


=  0 


where 

c  =  0.08,  g  =  32.2,  W=  527.436,  and  5  =  470.327. 

Setting  a  =  (W—  B)/ c  and  */=300cg/  W  puts  (5)  in  the  simpler  form 

g  (t>)  =  v  +  d  +  a  ln(  1  —  v/ a)  =  0.  (6) 

Newton’s  iteration  scheme  for  this  problem  is 
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v 


n+  1 


— — — -  =  V„  H - ; - 

g(cn)  vn/ a 


a  —  v 


=  - ^ -  [vn  +  d+  aln(l  —  vn/a)]9  w  =  0, 1,2,.. 


Sample  Pascal  and  Fortran  programs  to  compute  the  first  N  iterates  of  v0 
are  given  below. 


Pascal  Program 

Program  Newton  (input,  output); 

const 
c  =0.08; 
g  =32.2; 

W  =  527.436; 

B  =470.327; 

var 

V:  array [0. .  199]  of  real; 
a,  d:  real; 
k,  N:  integer; 

begin 

readln(V[0],  N); 
a:=(W  — B)/c; 
d  :=300*c*g/W; 
page; 

writeln(‘N’:4,  ‘V[N]’:14); 
for  k  :=0  to  N  do 
begin 

writeln(K:4,  ‘  ’:4,  V[k]:17:9); 

V[k+1]  :=V[k]  +  ((a  — V[k])/V[k]) 

*  (V[k]H-d-ha*  ln(1  —  (V[k]/a))); 

end; 

end. 


Fortran  Program 

Change  every  X  to  V,  and  replace  the  instructions  for  X(1)  and  X(K)  in  the 
Fortran  program  of  Section  1.11  by 

V(1 )  =  V0  +  (( A  -  V0) /V0)  *  (' V0  +  D  +  A  *  A  LOG  (1  -  (V0  /  A ))) 
and 
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V(K)  =  V(K  —  1 )  +  ((A  —  V(K  —  1 )) /V(K  —  1  ))*  (V(K-1)  +  D 
+  A*  A  LOG  (1  — ( V(K  —  1 ) /A))) 

(Before  running  these  programs,  of  course,  we  must  instruct  the  computer 
to  evaluate  the  constants  a  =  (W—  B)/ c  and  rf=300  cg/W.) 

As  was  shown  in  Section  1.7,  u0  =  45.7  is  a  very  good  approximation  of 
v.  We  set  v0  =  45.1  in  the  above  programs,  and  the  iterates  vn  converged 
very  rapidly  to  u  =  45.1  ft/s.  Thus,  the  drums  can  indeed  break  upon  im¬ 
pact. 

In  general,  it  is  not  possible  to  determine,  a  priori,  how  many  iterations 
will  be  required  to  achieve  a  certain  accuracy.  In  practice,  we  usually  take 
N  very  large,  and  instruct  the  computer  to  terminate  the  program  if  one  of 
the  iterates  agrees  with  its  predecessor  to  the  desired  accuracy. 

See  also  C  Program  3  in  Appendix  C  for  a  sample  C  program. 


Exercises 

1.  Show  that  the  iterates  xn  defined  by  (4)  will  converge  to  V2  if 

V273  <*0<V2  +(V2  -  VV3  ). 

2.  Use  Newton’s  method  to  find  the  following  numbers  to  8  significant  decimal 
places,  (a)  V3  ,  (b)  V5  ,  (c)  V7 . 

3.  The  number  m  is  a  root  of  the  equation 

tan-r  -  cot  —  =0. 

4  4 

Use  Newton’s  method  to  find  m  to  8  significant  decimal  places. 

Show  that  each  of  the  following  equations  has  a  unique  solution  in  the 
given  interval,  and  use  Newton’s  method  to  find  it  to  5  significant  decimal 
places. 

4.  2x  —  tanx  =  0;  ‘tt<x<3‘jt/2  5.  \  —  jc  + jsin;t=0;  j  < * <  1 

6.  lnx+(x+ 1)3 =0;  0<x<l  7.  iVx  =cos^;  0<jc<1 

8.  (x—l)2  —  jex  =  0;  0<x<l  9.  x-e~*2=l;  0<x<2. 

1.12  Difference  equations,  and  how  to  compute 
the  interest  due  on  your  student  loans 

In  Sections  1.13-1.16  we  will  construct  various  approximations  of  the  solu¬ 
tion  of  the  initial-value  problem  dy/dt=f(t,y),y(t0)=y0.  In  determining 
how  good  these  approximations  are,  we  will  be  confronted  with  the  follow- 
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ing  problem:  How  large  can  the  numbers  El9...9EN  be  if 

En  +  l  <AEn  +  B,  *  =  0,1,...,A-1  (1) 

for  some  positive  constants  A  and  B ,  and  £’o=0?  This  is  a  very  difficult 
problem  since  it  deals  with  inequalities ,  rather  than  equalities.  Fortunately, 
though,  we  can  convert  the  problem  of  solving  the  inequalities  (1)  into  the 
simpler  problem  of  solving  a  system  of  equalities.  This  is  the  content  of  the 
following  lemma. 

Lemma  1.  Let  El9...9EN  satisfy  the  inequalities 

En  +  \  <AEn  +  B,  E0  =  0 

for  some  positive  constants  A  and  B.  Then ,  En  is  less  than  or  equal  to  yn, 
where 

yn+\=Ayn+B,  >’o=°-  (2) 

Proof.  We  prove  Lemma  1  by  induction  on  n.  To  this  end,  observe  that 
Lemma  1  is  obviously  true  for  n  —  0.  Next,  we  assume  that  Lemma  1  is  true 
for  n=j.  We  must  show  that  Lemma  1  is  also  true  for  /i=y+  1.  That  is  to 
say,  we  must  prove  that  Ej<  y}  implies  Ej+l  <  yJ+v  But  this  follows  im¬ 
mediately,  for  if  Ej  <  yj  then 

Ej+ 1  <AEj  +  B<  Ayj  +  B  =yj+v 

By  induction,  therefore,  En  <  yn,  n  =  0, 1 , . . . ,  N.  □ 

Our  next  task  is  to  solve  Equation  (2),  which  is  often  referred  to  as  a  dif¬ 
ference  equation.  We  will  accomplish  this  in  two  steps.  First  we  will  solve 
the  “simple”  difference  equation 

yn+i=y«  +  Bn’  yo=yo ■  (3) 

Then  we  will  reduce  the  difference  equation  (2)  to  the  difference  equation 
(3)  by  a  clever  change  of  variables. 

Equation  (3)  is  trivial  to  solve.  Observe  that 

yi~yo  =Bo 
y i  y i  =B\ 

y  n— \  yn  — 2  Bn  —  2 
yn  y n—\~  Bn  —  1  * 

Adding  these  equations  gives 

(yn-yn-\)+(yn-\-yn-2)  +  •••  +(>'i“>;o)==j8o+5i‘f  •••  +  Bn-v 

Hence, 

n  —  1 

yn=yo+Bo+  •••  +  Bn-i=yo+  2  Br 

j-o 
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Next,  we  reduce  the  difference  equation  (2)  to  the  simpler  equation  (3) 
in  the  following  clever  manner.  Let 


yn 


A"’ 


n  =  0, 


Then,  zn+l  =yn+l/A n+l.  Buty„+1  =  Ay„  +  B.  Consequently, 

y„  B  B 


Zn  +  1 


H - — —  =  z  +  — 

A n  A n+ 1  n  A n+ 1 


Therefore, 


and 


yn=Anz„=Any0  + 


B 


A-  1 


(An~l). 


Finally,  returning  to  the  inequalities  (1),  we  see  that 

En<~-^(An-\),  n  =  l,2,...,N. 


(4) 

(5) 


While  collecting  material  for  this  book,  this  author  was  approached  by  a 
colleague  with  the  following  problem.  He  had  just  received  a  bill  from  the 
bank  for  the  first  payment  on  his  wife’s  student  loan.  This  loan  was  to  be 
repaid  in  10  years  in  120  equal  monthly  installments.  According  to  his 
rough  estimate,  the  bank  was  overcharging  him  by  at  least  20%.  Before 
confronting  the  bank’s  officers,  though,  he  wanted  to  compute  exactly  the 
monthly  payments  due  on  this  loan. 

This  problem  can  be  put  in  the  following  more  general  framework. 
Suppose  that  P  dollars  are  borrowed  from  a  bank  at  an  annual  interest  rate 
of  R  %.  This  loan  is  to  be  repaid  in  n  years  in  equal  monthly  installments  of 
x  dollars.  Find  x. 

Our  first  step  in  solving  this  problem  is  to  compute  the  interest  due  on 
the  loan.  To  this  end  observe  that  the  interest  I x  owed  when  the  first  pay¬ 
ment  is  due  is  Ix=(r/\2)P,  where  r  =  R/ 100.  The  principal  outstanding 
during  the  second  month  of  the  loan  is  (x  —  7j)  less  than  the  principal  out¬ 
standing  during  the  first  month.  Hence,  the  interest  /2  owed  during  the  sec¬ 
ond  month  of  the  loan  is 


h-h-Jjix-h)- 

Similarly,  the  interest  L+i  owed  during  the  (j  +  l)st  month  is 


93 


1  First-order  differential  equations 


(6) 


where  Ij  is  the  interest  owed  during  the  yth  month. 
Equation  (6)  is  a  difference  equation  for  the  numbers 

I  =  —  P  I  I 
1  i  |2  r ' J*2’  *  *  *  9 1 12w* 

Its  solution  (see  Exercise  4)  is 


Hence,  the  total  amount  of  interest  paid  on  the  loan  is 


12  n 


/=/l  +  /2+  ...  +/i2w=  2  Ij 


j=  1 
7-1 


12 n  12  n 

7-1  7-1 


Now, 


7-1  L 


Therefore, 


7=P 


+  \2nx  — 


\2x 


(>^r- 


/  -  \12* 

=  \2nx-P  +  P\\  +  -^}  - 


12.x 


But,  12 nx  —  P  must  equal  /,  since  12hx  is  the  amount  of  money  paid  the 
bank  and  P  was  the  principal  loaned.  Consequently, 


< 


p  i+ 


6) 


12  n 


\2x 


=  0 


and  this  equation  implies  that 


12 


(1+n) 


12  n 


X  — 


( 


\  12« 

1  +  T2) 


(7) 


Epilog,  Using  Equation  (7),  this  author  computed  x  for  his  wife’s  and 
his  colleague’s  wife’s  student  loans.  In  both  cases  the  bank  was  right — to 
the  penny. 
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Exercises 

1.  Solve  the  difference  equation  yn + x  —  —  lyn  +  2 ,y0  =  1 . 

2.  Find  y37  if  ^+1  =  3^+ 1,^0  =  0,  n  =  0, 1 . 36. 

3.  Estimate  the  numbers  E0,EX,...,EN  if  E0  =  0  and 

(a)  En  +  l<  3En  +  1,  w  =  0,  l,...,iV—  1; 

(b)  En+ 1  < 2En  +  2,  /i  =  0, 1,...,JV  —  1. 

4.  (a)  Show  that  the  transformation  yj  =  Ij+x  transforms  the  difference  equation 

^+i"(1  +  J2  )Ij~  l2x 9  Il  =  l2P 

into  the  difference  equation 

yj+ i“(1  +  nX'  •Vo==  l2P' 

(b)  Use  Equation  (4)  to  find  yJ_lssIj. 

5.  Solve  the  difference  equation  yn+ x  =  anyn  +  bn,  yx  =  a.  Hint :  Set  zx  =yx  and 
z„  —yn/ax . . .an_  x  for  n  >  2.  Observe  that 

^/i  + 1  ^/» 

zrt+i  = - - - 1 - 

Cl  j . . .  Qn  Cl  1 . .  •  Qn  Cl  1 . . .  dn 

4.  K 

=  z„  H - . 

n  ax...an 

Hence,  conclude  that  z„  =  zx  +  Ej~xbj/ax . . . ay 

6.  Solve  the  difference  equation yn+x  —  nyn=\  —  n, yx=2. 

7.  Find 725  if y\~  1  and  {n  +  \)yn+x  —  nyn  =  2nf  n=  1,...,24. 

8.  A  student  borrows  P  dollars  at  an  annual  interest  rate  of  R%.  This  loan  is  to  be 
repayed  in  n  years  in  equal  monthly  installments  of  *  dollars.  Find  x  if 

(a)  P  =  4250,  R  =  3,  and  n  =  5; 

(b)  P  =  5000,  R  =  l,  and  «=  10. 

9.  A  home  buyer  takes  out  a  $30,000  mortgage  at  an  annual  interest  rate  of  9%. 
This  loan  is  to  be  repaid  over  20  years  in  240  equal  monthly  installments  of  x 
dollars. 

(a)  Compute  x. 

(b)  Find  x  if  the  annual  interest  rate  is  10%. 

10.  The  quantity  supplied  of  some  commodity  in  a  given  week  is  obviously  an  in¬ 
creasing  function  of  its  price  the  previous  week,  while  the  quantity  demanded 
in  a  given  week  is  a  function  of  its  current  price.  Let  Sj  and  Dj  denote,  respec¬ 
tively,  the  quantities  supplied  and  demanded  in  the  yth  week,  and  let  Pj  denote 
the  price  of  the  commodity  in  the  yth  week.  We  assume  that  there  exist  positive 
constants  a ,  b ,  and  c  such  that 

Sj  =  aPj  _  x  and  Dj  —  b  —  cPy 

(a)  Show  that  Pj  =  b/(a  +  c)  +  (—  a/c)  J(P0  —  b/(a  +  c)),  if  supply  always 
equals  demand. 
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(b)  Show  that  Pj  approaches  b/{a  +  c)  as  j  approaches  infinity  if  a/c<  1. 

(c)  Show  that  P  =  b/(a  +  c)  represents  an  equilibrium  situation.  That  is  to  say, 
if  supply  always  equals  demand,  and  if  the  price  ever  reaches  the  level 
b/(a  +  c),  then  it  will  always  remain  at  that  level. 


1.13  Numerical  approximations;  Euler’s  method 

In  Section  1.9  we  showed  that  it  is  not  possible,  in  general,  to  solve  the  ini¬ 
tial-value  problem 

y(t0)=y0.  (l) 

Therefore,  in  order  that  differential  equations  have  any  practical  value  for 
us,  we  must  devise  ways  of  obtaining  accurate  approximations  of  the  solu¬ 
tion  y{t)  of  (1).  In  Sections  1.13-1.16  we  will  derive  algorithms,  which  can 
be  implemented  on  a  digital  computer,  for  obtaining  accurate  approxima¬ 
tions  of  >>(/). 

Now,  a  computer  obviously  cannot  approximate  a  function  on  an  entire 
interval  t0  <  t  <  t0  +  a  since  this  would  require  an  infinite  amount  of  infor¬ 
mation.  At  best  it  can  compute  approximate  values  yx,...,yN  of  y(t)  at  a 
finite  number  of  points  However,  this  is  sufficient  for  our  pur¬ 

pose  since  we  can  use  the  numbers  y X9...9yN  to  obtain  an  accurate  ap¬ 
proximation  of  y(t)  on  the  entire  interval  t0  <  t  <  r0  +  a.  To  wit,  let  y(t)  be 
the  function  whose  graph  on  each  interval  [{,-,{,•+ J  is  the  straight  line  con¬ 
necting  the  points  (tj9yj)  and  (fy+iJ'y+i)  (see  Figure  1).  We  can  express  y{t) 
analytically  by  the  equation 

y  (t)=yJ+\{t-  tj)(yj+ 1  -yj)>  h  <  /  <  tj+i. 
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If  y(t)  is  close  to y{t)  at  t  =  ty  that  is,  if  yj  is  close  to y(tj),  and  if  tj+x  is 
close  to  tp  then  y  (7)  is  close  toy(/)  on  the  entire  interval  tj<  t<  tJ+x.  This 
follows  immediately  from  the  continuity  of  bothy (7)  andy(/).  Thus,  we 
need  only  devise  schemes  for  obtaining  accurate  approximations  ofy(/)  at 
a  discrete  number  of  points  tx,...,tN  in  the  interval  t0 <  t  <  tQ+  a.  For  sim¬ 
plicity,  we  will  require  that  the  points  tx,...,tN  be  equally  spaced.  This  is 
achieved  by  choosing  a  large  integer  N  and  setting  tk  =  t0+  k(a/N),  k  = 
1, . . . ,  N.  Alternately,  we  may  write  tk+x  =  tk  +  h  where  h  =  a/ N. 

Now,  the  only  thing  we  know  about y(t)  is  that  it  satisfies  a  certain  dif¬ 
ferential  equation,  and  that  its  value  at  t  =  t0  is  y0.  We  will  use  this  infor¬ 
mation  to  compute  an  approximate  value yx  of  y  at  t=tx  =  t0+  h.  Then,  we 
will  use  this  approximate  value  yx  to  compute  an  approximate  value  y2  of  y 
at  /  =  t2  —  tx  +  h,  and  so  on.  In  order  to  accomplish  this  we  must  find  a  the¬ 
orem  which  enables  us  to  compute  the  value  of  y  at  t  =  tk  +  h  from  the 
knowledge  of  y  at  t  =  tk.  This  theorem,  of  course,  is  Taylor’s  Theorem, 
which  states  that 


,  ,^(4)  ,  h2  d2y(‘k )  , 
y{tk  +  h)  -y  (4)  +  h—^~  +  + 


•  •  •  • 


Thus,  if  we  know  the  value  of  y  and  its  derivatives  at  t  =  tk ,  then  we  can 
compute  the  value  of  y  at  t  =  tk  +  h .  Now,  y{t)  satisfies  the  initial-value 
problem  (1).  Hence,  its  derivative,  when  evaluated  at  t  =  tk,  must  equal 
f(tk*yOk))-  Moreover,  by  repeated  use  of  the  chain  rule  of  partial  dif¬ 
ferentiation  (see  Appendix  A),  we  can  evaluate 

d2y(Q  [  a/  a/ 

dt 2  [3/  J  9y 

and  all  other  higher-order  derivatives  ofy(/)  at  t  =  tk.  Hence,  we  can  re¬ 
write  (2)  in  the  form 


(^y(Q) 


y(tk+i)=y(tk)+W(tk,y(tk)) 


+  £ 
2! 


V.fV 

3 1  J  a y 


The  simplest  approximation  ofy(4  +  1)  is  obtained  by  truncating  the 
Taylor  series  (3)  after  the  second  term.  This  gives  rise  to  the  numerical 
scheme 

y\  =yo+hf{t0,y0),  y2=y\+W(ti>y\)> 

and,  in  general, 


yo=y(to)-  (4) 

Notice  how  we  use  the  initial-value  y0  and  the  fact  that  y{t)  satisfies  the 
differential  equation  dy /dt=f(t,y)  to  compute  an  approximate  value  yx  of 
y(t)  at  t  =  tx.  Then,  we  use  this  approximate  value  yx  to  compute  an  ap¬ 
proximate  value  y2  of  y(t)  at  t  =  t2,  and  so  on. 
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Equation  (4)  is  known  as  Euler* s  scheme.  It  is  the  simplest  numerical 

scheme  for  obtaining  approximate  values  y j, _ _ yN  of  the  solution  y{t)  at 

times  tN.  Of  course,  it  is  also  the  least  accurate  scheme,  since  we  have 
only  retained  two  terms  in  the  Taylor  series  expansion  for  >>(/).  As  we  shall 
see  shortly,  Euler’s  scheme  is  not  accurate  enough  to  use  in  many  prob¬ 
lems.  However,  it  is  an  excellent  introduction  to  the  more  complicated 
schemes  that  will  follow. 

Example  1.  Lety(/)  be  the  solution  of  the  initial-value  problem 

dy/dt  =  \  +  {y-tf,  >>(0)  =  |. 

Use  Euler’s  scheme  to  compute  approximate  values  of  y{t)  at  the 

points  tx  =  l/N,t2  =  2/N,...,tN  =  1. 

Solution.  Euler’s  scheme  for  this  problem  is 

^+i=>’a:  +  /i[1  +  (>'*:-4)2]>  k=0,l,...,N-l,  h  =  l/N 

with  y0  =  \.  Sample  Pascal  and  Fortran  programs  to  compute  yl9 . . . ,  yN  are 
given  below.  These  programs,  as  well  as  all  subsequent  programs,  have 
variable  values  for  t0 ,  y0,  a,  and  A,  so  that  they  may  also  be  used  to  solve 
the  more  general  initial-value  problem  dy/dt—  1  +(y  —  t)2,  y(t0)=y0  on 
any  desired  interval.  Moreover,  these  same  programs  work  even  if  we 
change  the  differential  equation;  if  we  change  the  function  f(t,y)  then  we 
need  only  change  line  12  in  the  Pascal  program  (and  line  1 1  in  the  C  program) 
and  the  expressions  for  Y(1)  and  Y(K)  in  Section  B  of  the  Fortran  program. 

Pascal  Program 
Program  Euler  (input,  output); 
var 

T,  Y:  array[0.  .999]  of  real; 
a,  h:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],a,  N); 

h  :=a/N; 

page; 

for  k:=Q  to  N  — 1  do 
begin 

T[k+1]:==T[k]  +  h; 

Y[k  + 1]  := Y[k]  +  h  *  (1  +  (Y[k]-T[k])  *  (Y[k] -T[k])); 
end; 

writeln(‘T’:4,  ‘Y’:16); 
for  k  :=0  to  N  do 
writeln(T[k]:10:7,  ‘  ':2,  Y[k]:16:9); 


end. 
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Fortran  Program 


Section  A 
Read  in  data 


r 


DIMENSION  T(1 000),  Y(1 000) 
READ  (5, 10)  TO,  Y0,  A,  N 
FORMAT  (3F20.8, 15) 

H  =  A/N 


Section  B 
Do  computations 


l  20 


T(1)  =  T0  +  H 

Y(1)  =  Y0  +  H*(1  +  (Y0  -  TO)  *  *2) 

DO  20  K  =  2,N 

T(K)  =  T(K  — 1)  +  H 

Y(K)  =  Y(K  - 1 )  +  H  *  (1  +(Y(K  —  1) 

—  T(K  —  1))*  *2) 

CONTINUE 


Section  C 
Print  out 
results 


30 


WRITE  (6, 30)  TO,  Y0,  (T(J),  Y(J),  J  =  1 ,  N) 
FORMAT  (1  HI ,  3X,  1 HT,  4X,  1 HY,  /(I  H,  1 X, 
F10.7.2X,  F20.9/)) 

CALL  EXIT 
END 


See  also  C  Program  4  in  Appendix  C  for  a  sample  C  program. 

Table  1  below  gives  the  results  of  these  computations  for  a—  1,  N=  10,  t0— 
0,  andy0=  j.  All  of  these  computations,  and  all  subsequent  computations, 
were  carried  out  on  an  IBM  360  computer  using  16  decimal  places  ac¬ 
curacy.  The  results  have  been  rounded  to  8  significant  decimal  places. 


Table  1 


t 

y 

t 

y 

0 

0.5 

0.6 

1.29810115 

0.1 

0.625 

0.7 

1.44683567 

0.2 

0.7525625 

0.8 

1.60261202 

0.3 

0.88309503 

0.9 

1.76703063 

0.4 

1.01709501 

1 

1.94220484 

0.5 

1.15517564 

The  exact  solution  of  this  initial-value  problem  (see  Exercise  7)  is 

y(t)  =  t+\/{2-t). 

Thus,  the  error  we  make  in  approximating  the  value  of  the  solution  at  t  =  1 
by yw  is  approximately  0.06,  sincey(l)  =  2.  If  we  run  this  program  for  N  = 
20  and  A  =  40,  we  obtain  thaty20=  1.96852339  and  y 40=  1.9835109.  Hence, 
the  error  we  make  in  approximating  y(\)  by  y40  is  already  less  than  0.02. 
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Exercises 

Using  Euler’s  method  with  step  size  h  =  0.1,  determine  an  approximate 
value  of  the  solution  at  t  =  1  for  each  of  the  initial-value  problems  1-5.  Re¬ 
peat  these  computations  with  h  =  0.025  and  compare  the  results  with  the 
given  value  of  the  solution. 

t.  %  =  \  +  t~y,  y(0)=0;(y(t)=t) 

2.  ^=2 ty,  /(0)  =  2;  (y(t)  =  2el2) 

3. ^  =  1  +y2-t2,  y(0)=0;  (y(t)=t) 

4-  <lt=te~y  +  T77i'  /(0)=0;  (/(0=ln(l  +  <2)) 

Ul  1  +  t 

$.%  =  -\+2t+ — ,  /(0)=  1  ;(/(/)=  1  +  /2) 
dt  (1  +  t2f 

6.  Using  Euler’s  method  with  /z  =  7t/40,  determine  an  approximate  value  of  the 
solution  of  the  initial-value  problem 

^  =  2sec2r-(l  +y2),  >-(0)=0 

at  /  =  7t/4.  Repeat  these  computations  with  /j  =  tt/160  and  compare  the  results 
with  the  number  one  which  is  the  value  of  the  solution y(t)  =  tan/  at  t  —  m/ 4. 

7.  (a)  Show  that  the  substitution y  =  t  +  z  reduces  the  initial- value  problem  y'  =  1  + 

O'  —  02j>;(0)  =  0.5  to  the  simpler  initial- value  problem  z'  =  z2,  z(0)  =  0.5. 

(b)  Show  that  z(t)  =  1  /(2  - t).  Hence,  y(t)  =  t  + 1/(2  —  /). 


LI 3.1  Error  analysis  for  Euler's  method 

One  of  the  nice  features  of  Euler’s  method  is  that  it  is  relatively  simple  to 
estimate  the  error  we  make  in  approximating  y(tk)  by  yk.  Unfortunately, 
though,  we  must  make  the  severe  restriction  that  do  not  exceed  t0 

+  a,  where  a  is  the  number  defined  in  the  existence-uniqueness  theorem  of 
Section  1.10.  More  precisely,  let  a  and  b  be  two  positive  numbers  and 
assume  that  the  functions  /,  3//  dt,  and  df/dy  are  defined  and  continuous 
in  the  rectangle  t0  <  /  <  /0  +  a,  y0  —  b  <  y  <  y0  +  b.  We  will  denote  this  rect¬ 
angle  by  R.  Let  M  be  the  maximum  value  of  \f(t,y)\  for  (t9y)  in  R,  and  set 
a  =  min (a,b/ M).  We  will  determine  the  error  committed  in  approximating 

y(Q  by  y»  for  **<  *o+«- 

To  this  end  observe  that  the  numbers  satisfy  the  difference 

equation 

yk+ 1  y k hf (tk>y k)>  ^  o,i,...,tv  i 

while  the  numbers y(t0),y(tf,...,y(tN)  satisfy  the  difference  equation 
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h 


y  ( 4 + 1 )  =y  C 4)  +  hf(  4>7  ( 4))  +  t 


9/9/ 

3/  7  9y 


(4*7(4))  (2) 


where  4  is  some  number  between  tk  and  tk+l.  Equation  (2)  follows  from 
the  identity 

d2y  9/  9/ 
dt 2  9/  *  dy 

and  the  fact  that 

/  ,v  /x  ,  4K0  h2  d  y(T) 

y(t  +  h)=y(t)  +  h — : —  + 


dr 


for  some  number  r  between  t  and  /  +  /l  Subtracting  Equation  (1)  from 
Equation  (2)  gives 

7(4+i)-7*+i-7(4)-7*  +  a[/(4^(4))-/(4»7*)] 

h2 


+ 


9 1  S  9 y 


(4- >'(4))- 


Next,  observe  that 


/(4.7(4))-/(4.7*)=  [7(4)~7&] 

where  y\k  is  some  number  between  y(tk)  and  yk.  Consequently, 

9/(4*%) 


l7(4+i)~7*+il  <  \y((k)~yk\  +  fl 


9y 


\y({k)-yk\ 


+ 


y.,y 

9 1  J  dy 


(4*7(4)) 


In  order  to  proceed  further,  we  must  obtain  estimates  of  the  quantities 

(9/(4> ''?*))/ 9^  and  [(3//90+/(3//3y)K4»^(4))-  To  this  end  observe  that 
the  points  (4>.H4))  and  (4^)  in  rectangle  R .  (It  was  shown  in 
Section  1.10  that  the  points  (4^(4))  in  /i.  In  addition,  a  simple  induc¬ 
tion  argument  (see  Exercise  9)  shows  that  the  points  ( tk9yk )  all  lie  in  R.) 
Consequently,  the  points  (tk,rjk)  must  also  lie  in  R.  Let  L  and  D  be  two 
positive  numbers  such  that 


and 


max 

(t,y)  in  R 


M. 

dy 


<L 


max 

( t,y )  in  R 


y  ,v 

9 1  J  dy 


<  D. 


Such  numbers  always  exist  if  /,  df/dt,  and  9//9y  are  continuous  in  R. 
Then, 


r%L 2 

17(4+1)  -7*+il  <  \y(tk) -yk  I  +  ^b(4)  -7*1  +  —  •  (3) 
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Now,  set  Ek  =  \y(tk)—yk\,  k  =  0,l,...,N.  The  number  Ek  is  the  error  we 
make  at  the  kth  step  in  approximating  y(tk)  by  yk.  From  (3) 

Ek+l<(\  +  hL)Ek+~,  k  =  0,l,...,N—l.  (4) 

Moreover,  E0  =  0  since  y(t0)=y0.  Thus,  the  numbers  E0,EV...,EN  satisfy 
the  set  of  inequalities 

Ek+\<tAEk  +  B,  Eq  =  0 

with  A  =  1  +  hL  and  B  —  Dh2/2.  Consequently,  (see  Section  1.12) 

Ek<^(Ak-\)=^{(\  +  hL)k-\].  (5) 

We  can  also  obtain  an  estimate  for  Ek  that  is  independent  of  k.  Observe 
that  1  +  hL  <  ehL.  This  follows  from  the  fact  that 


Therefore, 


ehL=  l  +  hL  + 


(hL? 

2! 


+  .  .  . 


=  (1  +  hL)  +  “something  positive”. 


Finally,  since  kh  <  a,  we  see  that 


Ek< 


Dh 
2  L 


Equation  (6)  says  that  the  error  we  make  in  approximating  the  solution 
y(t)  at  time  t  =  tk  by  yk  is  at  most  a  fixed  constant  times  h .  This  suggests,  as 
a  rule  of  thumb,  that  our  error  should  decrease  by  approximately  \  if  we 

decrease  h  by  We  can  verify  this  directly  in  Example  1  of  the  previous 
section  where  our  error  at  /  =  1  for  h  =  0.1,  0.05,  and  0.025  is  0.058,  0.032, 
and  0.017  respectively. 


Example  1.  Lety(/)  be  the  solution  of  the  initial-value  problem 


dy  t2+y 2 

di=  2  ’ 


7(0)  =  0. 


(a)  Show  that y(0  exists  at  least  for  0  <  t  <  1,  and  that  in  this  interval,  —  1 
<  y(t)  <  1. 

(b)  Let  N  be  a  large  positive  integer.  Set  up  Euler’s  scheme  to  find  ap¬ 
proximate  values  of  y  at  the  points  tk  =  k/ N,  k =0, 1, . . . ,N. 

(c)  Determine  a  step  size  h  =  l/N  so  that  the  error  we  make  in  approxi¬ 
mating  y  (4)  by  yk  does  not  exceed  0.0001. 

Solution .  (a)  Let  R  be  the  rectangle  0  <  /  <  1,  - 1  <  y  <  1.  The  maximum 
value  that  (t2+y2)/2  achieves  for  (t,y)  in  it  is  1.  Hence,  by  the  exist- 
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ence-uniqueness  theorem  of  Section  1.10,  y  (/)  exists  at  least  for 

0</<  a  =  min  ( 1 ,  {)  =  1, 
and  in  this  interval,  —  1  <  y  <  1. 


yk+\=yk+h 


*l+y2k 


2 


with  y0  =  0.  The  integer  k  runs  from  0  to  N—  1. 
(c)  Let  f(t,y)  =  (t2+y2)/2,  and  compute 


3/  3/  .3/  y  /  2  2\ 

and  +>,)- 

From  (6),  \y(tk)-yk\  <  (Dh/2L)(eL-  1)  where  L  and  D  are  two  positive 
numbers  such  that 

max  |y|<L 
(t,y)  in  R 


and 


max 

0,y)  in  R 


y 


t+lr(t2+y2) 


^  D. 


Now,  the  maximum  values  of  the  functions  \y\  and  \t  +  (y /2)(t2+y2)\  for 
( t,y )  in  R  are  clearly  1  and  2  respectively.  Hence, 


\y(tk)-yk\<  y(e-l)  =  Me-l). 


This  implies  that  the  step  size  h  should  be  smaller  than  0.0001/(e—  1). 
Equivalently,  N  should  be  larger  than  (e—  1 )  1 04  =  17,183.  Thus,  we  must 
iterate  the  equation 

1 

yk+ 1  yk+  2(17,183) 


17,183  times  to  be  sure  thaty(l)  is  correct  to  four  decimal  places. 


Example  2.  Lety(/)  be  the  solution  of  the  initial-value  problem 

(a)  Show  that  y  (7)  exists  at  least  for  0<  t  <  1,  and  that  in  this  interval,  —  1 
<  y  <  3. 

(b)  Let  TV  be  a  large  positive  integer.  Set  up  Euler’s  scheme  to  find  ap¬ 
proximate  values  of  y(t)  at  the  points  tk  =  k/N ,  k  =  0, 1,...,TV. 

(c)  Determine  a  step  size  h  so  that  the  error  we  make  in  approximating 
y(tk)  by  yk  does  not  exceed  0.0001. 

Solution. 

(a)  Let  R  be  the  rectangle  0  <  f  <  1,  \y—  1|  <  2.  The  maximum  value  that  t 2 
+  e~y  achieves  for  (t,y)  in  R  is  2.  Hence,  y(t)  exists  at  least  for  0</< 
min(  1,2/2)=  1,  and  in  this  interval,  —  1  <  y  <  3. 
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0>)  yk+i=yk  +  h(tk2  +  e  ykl)=yk  +  ( l/N)[(k/N)2  +  e  with  y0=l.  The 

integer  k  runs  from  0  to  TV  —  1. 

(c)  Let  f(t,y)=  t2+  e~y  and  compute 


V 

dy 


=  2t  —  2y(t2  +  e~y2)e~y2. 


From  (6),  |y(4)  —yk\  <  (Dh/2L)(eL—  1)  where  L  and  D  are  two  positive 
numbers  such  that 

max  I  —  2 ye  ~y2\  <  L 

( t,y )  in  R 


and 


max  \2t  —  2y(t2  +  e  y2)e  y2\<D. 

( t,y )  in  R 

Now,  it  is  easily  seen  that  the  maximum  value  of  \2ye~y2\  for  —  1  <  y  <  3  is 

V2/e  .  Thus,  we  take  L=  V 2 / e  .  Unfortunately,  though,  it  is  extremely 
difficult  to  compute  the  maximum  value  of  the  function 


\2t  —  2y(t2+  e~y2)e~y2\ 

for  ( t,y )  in  R.  However,  we  can  still  find  an  acceptable  value  D  by  observ¬ 
ing  that  for  ( t,y )  in  R , 

max|2/  —  2v(72  +  e~y2)e~y2\  <  max|2/|  +  max|2y  (t2  +  e~y2)e~y2\ 

<  max|2f|  +max|2ye"-);2|  Xmax(r2+  e~y2) 

=  2  +  2V2]~e  =2(1  +  V2/e). 

Hence,  we  may  choose  D  =  2(1  +  V  2 Je  ).  Consequently, 


2(l  +  V2/e)h[eVVe  -l] 


2V2/e 

This  implies  that  the  step  size  h  must  be  smaller  than 


V2A  0.0001 

1  +  \[2fe  -  1 


Examples  1  and  2  show  that  Euler’s  method  is  not  very  accurate  since 
approximately  20,000  iterations  are  required  to  achieve  an  accuracy  of  four 
decimal  places.  One  obvious  disadvantage  of  a  scheme  which  requires  so 
many  iterations  is  the  cost.  The  going  rate  for  computer  usage  at  present  is 
about  $1200.00  per  hour.  A  second,  and  much  more  serious  disadvantage, 
is  thaty*  may  be  very  far  away  fromy(^)  if  N  is  exceptionally  large.  To 
wit,  a  digital  computer  can  never  perform  a  computation  exactly  since  it 
only  retains  a  finite  number  of  decimal  places.  Consequently,  every  time 
we  perform  an  arithmetic  operation  on  the  computer,  we  must  introduce  a 
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“round  off”  error.  This  error,  of  course,  is  small.  However,  if  we  perform 
too  many  operations  then  the  accumulated  round  off  error  may  become  so 
large  as  to  make  our  results  meaningless.  Exercise  8  gives  an  illustration  of 
this  for  Euler’s  method. 


Exercises 


1.  Determine  an  upper  bound  on  the  error  we  make  in  using  Euler’s  method  with 
step  size  h  to  find  an  approximate  value  of  the  solution  of  the  initial-value  prob¬ 
lem 


dy  _  t2+y2 
Tt  2  ’ 


y(0)=i 


at  any  point  t  in  the  interval  [0,  §].  Hint :  Let  R  be  the  rectangle  0<f<l,0<y< 

2. 


2.  Determine  an  upper  bound  on  the  error  we  make  in  using  Euler’s  method  with 
step  size  h  to  find  an  approximate  value  of  the  solution  of  the  initial-value  prob¬ 
lem 

$  =  '-/,  ,H0)=0 

at  any  point  t  in  the  interval  [0, 1].  Hint:  Let  R  be  the  rectangle  0  <  /  <  1,  —  1  <  y 

<1. 

3.  Determine  an  upper  bound  on  the  error  we  make  in  using  Euler’s  method  with 
step  size  h  to  find  an  approximate  value  of  the  solution  of  the  initial-value  prob¬ 
lem 


%=t+ey,  /(0)=o 

at  any  point  t  in  the  interval  [0, 1  /(e+ 1)].  Hint  :  Let  R  be  the  rectangle  0  <  /  <  1, 
-l<y  <  1. 

4.  Determine  a  suitable  value  of  h  so  that  the  error  we  make  in  using  Euler’s 
method  with  step  size  h  to  find  an  approximate  value  of  the  solution  of  the  ini¬ 
tial-value  problem 

%=e'~y2’  y^=0 

at  any  point  t  in  the  interval  [0,  \/e]  is  at  most  0.0001.  Hint:  Let  R  be  the  rect¬ 
angle  0</<l,  —  1  <  y  <  1. 

5.  Determine  a  suitable  value  of  h  so  that  the  error  we  make  in  using  Euler’s 
method  with  step  size  h  to  find  an  approximate  value  of  the  solution  of  the  ini¬ 
tial-value  problem 

=  ?2  +  tan2_y,  ^(0)  =  0 

at  any  point  t  in  the  interval  [0,  |]  is  at  most  0.00001.  Hint:  Let  R  be  the  rectan- 
gle  0  — tt/4<  y  <  7t/4. 
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6.  Determine  a  suitable  value  of  h  so  that  the  error  we  make  in  using  Euler’s 
method  with  step  size  h  to  find  an  approximate  value  of  the  solution  of  the  ini¬ 
tial-value  problem 


cfy  _  1 

dt  1  +  t2+y2  ’ 


y(  0)=0 


at  any  point  t  in  the  interval  [0, 1]  is  at  most  0.0001.  Hint:  Let  R  be  the  rectangle 

0</<l,  —  1  <  y  <  1. 


7.  Let>>(/)  be  the  solution  of  the  initial- value  problem 

^ =f(t,y ),  r(0)=0. 

Suppose  that  \f(t,y)\  <  1,  |3//9y|  <  1,  and  |(9//3r)+/(9//3y)|  <2  in  the  rectan¬ 
gle  0  <  /  <  1,  —  \<y<\.  When  the  Euler  scheme 

yk+ 1  =yk + M(tk,yk\  h  = 

is  used  with  AT=  10,  the  value  of  y5  is  —  0.15[(-j£)5  —  1],  and  the  value  of  y6  is 
0.12[(-^)6—  1].  Prove  that  y(t)  is  zero  at  least  once  in  the  interval  (£,  f). 

8.  Let>>(/)  be  the  solution  of  the  initial-value  problem 

y'=f(t,y ),  yOo)=yo • 

Euler’s  method  for  finding  approximate  values  of  y(t)  is  yk+\  =yk  +  hf(tk,yk). 
However,  the  quantity  yk  +  hf(tk,yk)  is  never  computed  exactly:  we  always  in¬ 
troduce  an  error  ek  with  |  |  <  e .  That  is  to  say,  the  computer  computes  numbers 
yu?!’---’  such  that 

yk+\=yk+hf(tk>yk)+zk 


with y0 ^o-  Suppose  that  |3//9y|  <  L  and  |(0//3/)+/(3//3y)|  <  D  for  all  t  and 
>>• 

(a)  Show  that 

Ek+ 1  =  b(4+ 1) - 9k+  ll  <(1  +  hL)Ek  +  y h2  +  e 


(b)  Conclude  from  (a)  that 


Ek  < 


for  kh  <  a. 

(c)  Choose  h  so  that  the  error  Ek  is  minimized.  Notice  that  the  error  Ek  may  be 
very  large  if  h  is  very  small. 


9.  Let  y1,y2>*--  satisfy  the  recursion  relation 

yk+\=yk  +  hRtk,yk). 

Let  R  be  the  rectangle  t0  <  t  <  t0+ a, y0  —  b  <  y  <  y0+  b,  and  assume  that  \f{t,y)\ 
<  M  for  (t,y)  in  R.  Finally,  let  a  =  min (afb/ M). 

(a)  Prove  that  \yj—yo\  <  jhMf  as  long  as  jh  <  a.  Hint :  Use  induction. 

(b)  Conclude  from  (a)  that  the  points  (t^yj)  all  lie  in  R  as  long  as  j  <  a/h. 
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1.14  The  three  term  Taylor  series  method 

Euler’s  method  was  derived  by  truncating  the  Taylor  series 


y(tk+\)=y(tk)  +  hf{tk,y{tk)) 

h2 


+ 


at +J  a y 


(tk>y(tk))+  ••• 


after  the  second  term.  The  most  obvious  way  of  obtaining  better  numerical 
schemes  is  to  retain  more  terms  in  Equation  (1).  If  we  truncate  this  Taylor 
series  after  three  terms  then  we  obtain  the  numerical  scheme 


yk+i=yk+W(tk,yk)+ 


tL\V  +  fK' 

2  a  t  J  a y 


k  =  0,...,N—  1  (2) 


withy0=y(t0). 

Equation  (2)  is  called  the  three  term  Taylor  series  method .  It  is  obviously 
more  accurate  than  Euler’s  method.  Hence,  for  fixed  h,  we  would  expect 
that  the  numbers  yk  generated  by  Equation  (2)  are  better  approximations 
of  y(tk)  than  the  numbers  yk  generated  by  Euler’s  scheme.  This  is  indeed 
the  case,  for  it  can  be  shown  that  \y(tk)—yk\  is  proportional  to  h2  whereas 
the  error  we  make  using  Euler’s  method  is  only  proportional  to  h.  The 
quantity  h 2  is  much  less  than  h  if  h  is  very  small.  Tlius,  the  three  term 
Taylor  series  method  is  a  significant  improvement  over  Euler’s  method. 


Example  1.  Lety(7)  be  the  solution  of  the  initial-value  problem 

!  =  i+0 y(P)-s- 

Use  the  three  term  Taylor  series  method  to  compute  approximate  values  of 
y(t)  at  the  points  tk  =  k/N ,  k= 

Solution.  Let  f(t9y)  =  1  +  (y  —  t )2.  Then, 

|  +/-|  =  -  2{y  - 1)  +  2(y  - 1)  [  1  +  (y  - 1)2 ] : =  2(y  -  (?■ 

Hence,  the  three  term  Taylor  series  scheme  is 

yk+ 1  =yk + h[ 1 + ( yk  -  **)2] + *2(^*  -  hf 

with  h  =  l/N  and  y0  =  The  integer  k  runs  from  0  to  N  —  1.  Sample  Pascal 

and  Fortran  programs  to  compute  y  1? _ _ yN  are  given  below.  Again,  these 

programs  have  variable  values  for  t0 ,  y0 ,  a ,  and  N. 
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Pascal  Program 

Program  Taylor  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  Temp:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  —  1  do 
begin 

Temp  :=Y[k]  — T[k]; 

T[k  +  1]:=T[k]  +  h; 

Y[k  +  1]  :=Y[k]  +  h*(1  +  Temp* Temp) 
4-  h  *  h  *  Temp  *  Temp  *  T emp; 

end; 

writeln(T’:4,  ‘Y’:16); 
for  k  :=0  to  N  do 
writeln (T[k]:10:7, 4  :2,  Y[k]:16:9); 

end. 


Fortran  Program 


Replace  Section  B  of  the  Fortran  program  in  Section  1.13  by  the  follow¬ 
ing: 


20 


T(1)  =  T0  +  H 

D2Y  =  H * (Y0  —  TO) *  *3 

Y(1)  =  Y0  +  H*(D2Y  +  1  +(Y0-T0)*  *2) 

D020  K  =  2,N 

T(K)  =  T(K  — 1)  +  H 

D2Y  =  H*(Y(K  — 1)— T(K  — 1))*  *3 

Y(K)  =  Y(K  —  1 ) 4- H * (D2Y  +  14-  (Y(K  —  1 )  —  T(K  —  1 )) *  *2) 

CONTINUE 


See  also  C  Program  5  in  Appendix  C  for  a  sample  C  program. 

Table  1  below  shows  the  results  of  these  computations  for  a  =  1,  N  =  10, 
ro  =  0,  andyo  =  0.5. 


Now  Euler’s  method  with  N=  10  predicted  a  value  of  1.9422  fory(l).  No¬ 
tice  how  much  closer  the  number  1.9957  is  to  the  correct  value  2.  If  we  run 
this  program  for  N= 20  and  A =40,  we  obtain  that  y20=  1.99884247  and 
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Table  1 


t 

y 

t 

y 

0 

0.5 

0.6 

1.31331931 

0.1 

0.62625 

0.7 

1.4678313 

0.2 

0.7554013 

0.8 

1.63131465 

0.3 

0.88796161 

0.9 

1.80616814 

0.4 

1.02456407 

1 

1.99572313 

0.5 

1.1660084 

yA0=  1.99969915.  These  numbers  are  also  much  more  accurate  than  the  val¬ 
ues  1.96852339  and  1.9835109  predicted  by  Euler’s  method. 

Exercises 

Using  the  three  term  Taylor  series  method  with  h  =  0.1,  determine  an  ap¬ 
proximate  value  of  the  solution  at  t  =  1  for  each  of  the  initial-value  prob¬ 
lems  1-5.  Repeat  these  computations  with  h  =  0.025  and  compare  the  re¬ 
sults  with  the  given  value  of  the  solution. 

1.  dy/dt=\  +  t-y,  y(0)  =  0;  (y(t)  =  0 

2.  dy / dt  =  2ty,  y( 0)  =  2;  (y(t)  =  2et2) 

3.  dy/dt=\+y2-t2,  y(0)=0;  (y(0=0 

4.  dy / dt=  te~y  + 1 /(1  +  /2),  y(0)  =  0;  (y(/)  =  ln(l  + 12)) 

5.  dy/dt=-\+2t+y2/{\  +  t2)\  y( 0)=1;  (y(0=l  +  /2) 

6.  Using  the  three  term  Taylor  series  method  with  h  =  ir/40 ,  determine  an  ap¬ 
proximate  value  of  the  solution  of  the  initial-value  problem 

^  =2sec2/  — (1  +y2),  /(0)  =  0 

at  t  =  7t/4.  Repeat  these  computations  with  /z  =  7t/160  and  compare  the  results 
with  the  number  one  which  is  the  value  of  the  solution  y{t)= tan  t  at  /  =  7t/4. 


1.15  An  improved  Euler  method 

The  three  term  Taylor  series  method  is  a  significant  improvement  over 
Euler’s  method.  However,  it  has  the  serious  disadvantage  of  requiring  us  to 
compute  partial  derivatives  of  f(t9y ),  and  this  can  be  quite  difficult  if  the 
function  /(/,y)  is  fairly  complicated.  For  this  reason  we  would  like  to  de¬ 
rive  numerical  schemes  which  do  not  require  us  to  compute  partial  deriva¬ 
tives  of  f(t,y).  One  approach  to  this  problem  is  to  integrate  both  sides  of 
the  differential  equation  y'=  f(t,y)  between  tk  and  tk  +  h  to  obtain  that 

^(4+i)=>;(^)+  r*+ */(',/ (o 

j‘k 
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This  reduces  the  problem  of  finding  an  approximate  value  of  y(tk+x)  to  the 
much  simpler  problem  of  approximating  the  area  under  the  curve  f(t,y(t)) 
between  tk  and  tk  +  h.  A  crude  approximation  of  this  area  is  hf(tk,y(tk)), 
which  is  the  area  of  the  rectangle  R  in  Figure  la.  This  gives  rise  to  the 
numerical  scheme 

yk+i-yk  +  hfitkJk) 

which,  of  course,  is  Euler’s  method. 

A  much  better  approximation  of  this  area  is 

which  is  the  area  of  the  trapezoid  T  in  Figure  lb.  This  gives  rise  to  the 
numerical  scheme 

yk+i^yk*  2  (2) 

However,  we  cannot  use  this  scheme  to  determine yk+l  fromy^  since yk+l 
also  appears  on  the  right-hand  side  of  (2).  A  very  clever  way  of  overcoming 
this  difficulty  is  to  replace  yk  +  l  in  the  right-hand  side  of  (2)  by  the  value 
yk  +  hf(tk9yk)  predicted  for  it  by  Euler’s  method.  This  gives  rise  to  the 
numerical  scheme 

yk+\=yk+^[f(tk’yk)+f(tk  +  h,yk  +  hf{tk,yk))],  y0=y(t0)-  (3) 

Equation  (3)  is  known  as  the  improved  Euler  method.  It  can  be  shown 
that  \y(tk)—yk\  is  at  most  a  fixed  constant  times  h2.  Hence,  the  improved 
Euler  method  gives  us  the  same  accuracy  as  the  three  term  Taylor  series 
method  without  requiring  us  to  compute  partial  derivatives. 


Example  1.  Let  y(t)  be  the  solution  of  the  initial-value  problem 

±  =  l  +  (y-t)2,  y(0)-±. 
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Use  the  improved  Euler  method  to  compute  approximate  values  ofy(/)  at 
the  points  tk  =  k/N ,  k=  1 

Solution.  The  improved  Euler  scheme  for  this  problem  is 

^+1=^+1  f1 +(>'*- o2+ +(>"*- '*)2)-4+i]2} 

with  h  =  1/N  and  y0  =  0.5.  The  integer  k  runs  from  0  to  N  —  1.  Sample  Pascal 
and  Fortran  programs  to  compute  yv---,yN  are  given  below.  Again,  these 
programs  have  variable  values  for  /0,  y0 ,  a ,  and  N. 

Pascal  Program 

Program  Improved  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  R:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  —  1  do 
begin 

R  :=  1  +  (Y[k]  — T[k])  *  (Y[k]  — T[k]); 

T[k  +  1]:=T[k]  +  h; 

Y[k  +  1]:=Y[k]  +  (h/2)*(R  +  1 

+  (Y[k]  +  h*R-T[k  +  1])*(Y[k]  +  h*R-T[k  +  1])); 

end; 

writeln(T:4,  ‘Y’:16); 
for  k  :=0  to  N  do 

writeln  (T[k]:10:7, 1  ’:2,  Y[k]:16:9); 

end. 


Fortran  Program 


Replace  Section  B  of  the  Fortran  program  in  Example  1  of  Section  1.13  by 
the  following: 


20 


T(1)  =  T0  +  H 
R  =  1  +(Y0  — TO)*  *2 

Y(1)  =  Y0  +  (H/2)*(R  +  1  +(Y0  +  (H*R)  — T(1))*  *2) 

D020  K  =  2,  N 

T(K)  =  T(K-1)  +  H 

R  =  1  +  (Y(K  — 1)~T(K— 1))*  *2 

Y(K)  =  Y(K  —  1 ) 4- (H /2) * (R  + 1  +  (Y(K  —  1 )  +  (H * R)  —  T(K)) *  *2) 
CONTINUE 
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See  also  C  Program  6  in  Appendix  C  for  a  sample  C  program. 

Table  1  below  shows  the  results  of  these  computations  for  a  =  1,  N  =  10, 
/0  =  0,  andyo  =  0.5.  If  we  run  this  program  for  N  =  20  and  A  =  40  we  obtain 
that  y20  =  1.99939944  and  y40  =  1.99984675.  Hence  the  values  y10,  y20,  and 
y40  computed  by  the  improved  Euler  method  are  even  closer  to  the  correct 
value  2  than  the  corresponding  values  1.99572313,  1.99884246,  and 
1.99969915  computed  by  the  three  term  Taylor  series  method. 


Table  1 


t 

y 

t 

y 

0 

0.5 

0.6 

1.31377361 

0.1 

0.62628125 

0.7 

1.46848715 

0.2 

0.75547445 

0.8 

1.63225727 

0.3 

0.88809117 

0.9 

1.80752701 

0.4 

1.02477002 

1 

1.99770114 

0.5 

1.16631867 

Exercises 

Using  the  improved  Euler  method  with  A  =  0.1,  determine  an  approximate 
value  of  the  solution  at  t=  1  for  each  of  the  initial-value  problems  1-5.  Re¬ 
peat  these  computations  with  h  =  0.025  and  compare  the  results  with  the 
given  value  of  the  solution. 

1.  dy/dt=\  +  t-y,  y(0)  =  0;  (y(0  =  0 

2.  dy/dt  =  2ty ,  y( 0)  =  2;  ( y(t)  =  2e* *) 

3.  dy/dt=\+y2-t2,  y(0)  =  0;  (y(/)=0 

4.  dy  /  dt  =  te~y +  t/(\  +  t2\  y(0)  =  0;  (y(0“ln(l  + t2)) 

5.  dy/dt= —  \+2t+y2/(\  + 12)2,  y(0)=l;  (y(/)=l  +  /2) 

6.  Using  the  improved  Euler  method  with  /?  =  tt/40,  determine  an  approximate 
value  of  the  solution  of  the  initial-value  problem 

^  =  2sec2f  —  (l+y2),  >’(0)  =  0 

at  t  =  7r/ 4.  Repeat  these  computations  with  A  =  7r  / 1 60  and  compare  the  results 
with  the  number  one  which  is  the  value  of  the  solution y(t)  =  tan/  at  t  =  7r/4. 


1.16  The  Runge-Kutta  method 

We  now  present,  without  proof,  a  very  powerful  scheme  which  was  devel¬ 
oped  around  1900  by  the  mathematicians  Runge  and  Kutta.  Because  of  its 
simplicity  and  great  accuracy,  the  Runge-Kutta  method  is  still  one  of  the 
most  widely  used  numerical  schemes  for  solving  differential  equations.  It  is 
defined  by  the  equation 
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where  yQ=y(t^)  and 


A:  =  0,  1 , . . . ,  /V  —  1 


Lfc,\  “/(A’-VA:)’  Afc,2“/(4"*"  2 +  2 ^k,l) 

Lk,3  =/(4  +  +  \hLk  2),  Lk  A=f{tk  +  A, >>*  +  hLk3). 


This  formula  involves  a  weighted  average  of  values  of  f(t,y)  taken  at  diffe¬ 
rent  points.  Hence  the  sum  \[Lk  x  +  2Lk  2+2Lk3  +  Lk  4]  can  be  interpreted 
as  an  average  slope  It  can  be  shown  that  the  error  \y(t^)—yk\  *s  at  most  a 
fixed  constant  times  h4.  Thus,  the  Runge-Kutta  method  is  much  more  ac¬ 
curate  than  Euler’s  method,  the  three  term  Taylor  series  method  and  the 
improved  Euler  method. 


Example  1.  Let  y(t)  be  the  solution  of  the  initial-value  problem 

f-i+0-02. 

Use  the  Runge-Kutta  method  to  find  approximate  values yX9-..,yN  of  y  at 
the  points  tk  =  k/N ,  k=  1  ,...,iV. 

Solution.  Sample  Pascal  and  Fortran  programs  to  compute  yi,---9yNby  the 
Runge-Kutta  method  are  given  below.  These  programs  differ  from  our 
previous  programs  in  that  they  do  not  computer,  separately.  Rather,  they 
compute y1  in  the  same  “loop”  as  they  compute y2, ... ,yN.  This  is  accom¬ 
plished  by  relabeling  the  numbers  t0  andy0  as  tx  and  yx  respectively. 


Pascal  Program 


Program  Runge_Kutta  (input,  output); 
var 

T,  Y:  array [0. .999]  of  real; 
a,  h,  LK1,  LK2,  LK3,  LK4:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  — 1  do 
begin 

T[k+1]  :=T[k]  +  h; 

LK1  :=  1  +  (Y[k]  — T[k])  *  (Y[k]  — T[k]); 

LK2  :  =  1  +  ((Y[k]  +  (h/2)  *  LK1)  —  (T[k]  +  h/2)) 
*  «Y[k]  +  (h/2)  *  LK1)  —  (T[k]  +  h/2)); 
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LK3  :=  1  +  ((Y[k]  +  (h/2)  *  LK2)  -  (T[k]  +  h/2)) 

*  ((Y[k]  +  (h/2)  *  LK2)  -  (T[k]  +  h/2)); 

LK4  :=  1  +  ((Y[k]  +  h  *  LK3)  -  (T[k]  +  h)) 

*  ((Y  [k]  +  h  *  LK3)  —  (T  [k]  +  h)) ; 

Y[k  + 1  ]  := Y[k]  +  (h/6)  *(LK1  +  LK4+2*  (LK2  +  LK3)); 
end; 

writeln(‘T’:4,  ‘Y':16); 
for  k  :=0  to  N  do 

writeln(T[k]:10:7,  ‘  ’:2,  Y[k]:16:9); 

end. 


Fortran  Program 


10 


20 


30 


DIMENSION  T(1 000),  Y(1 000) 

READ  (5, 10)  T(1),  Y(1),  A,  N 
FORMAT  (3F20.8, 15) 

H  =  A/N 

D020  K  =  1,N 

T(K  +  1)  =  T(K)  +  H 

REAL  LK1,  LK2,  LK3,  LK4 

LK1  =1  +  (Y(K)  -  T(K))  *  *  2 

LK2  =  1  +  ((Y(K)  +  (H/2)  *  LK1 )  -  (T(K)  +  H/2))  *  *  2 

LK3  =  1  +  ((Y(K)  +  (H /2)  *  LK2)  -  (T(K)  +  H/2))  *  *  2 

LK4  =  1  +  ((Y(K)  +  H  *  LK3)  -  (T(K)  +  H))  *  *  2 

Y(K  +  1 )  =  Y(K)  +  (H /6)  *  (LK1  +  LK4  +  2*  (LK2  +  LK3)) 

CONTINUE 

NA=N+1 

WRITE  (6, 30)  (T(J),  Y(J),  J  =  1 ,  NA) 

FORMAT  (1  HI ,  3X,  1 HT, 4X,  1  HY,/(1  H,  1 X,  FI  0.7, 2X,  F20.9/)) 

CALL  EXIT 

END 


See  also  C  Program  7  in  Appendix  C  for  a  sample  C  program. 

Table  1  below  shows  the  results  of  these  computations  for  a=  1,  N  =  1 0, 
/o=0,  andj>0  =  0.5. 


Table  1 


t 

y 

t 

y 

0 

0.5 

0.6 

1.31428555 

0.1 

0.62631578 

0.7 

1.4692305 

0.2 

0.75555536 

0.8 

1.6333329 

0.3 

0.88823526 

0.9 

1.8090902 

0.4 

1.02499993 

1 

1.9999988 

0.5 

1.16666656 
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Notice  how  much  closer  the  number  yl0  =  1.9999988  computed  by  the 
Runge-Kutta  method  is  to  the  correct  value  2  than  the  numbers  >>10  = 
1.94220484,  yxo  =  1.99572312,  and  j>10  =  1.99770114  computed  by  the  Euler, 
three  term  Taylor  series  and  improved  Euler  methods,  respectively.  If  we 
run  this  program  for  N  =  20  and  TV  =  40,  we  obtain  that  y2o=  1.99999992 
and^40  =  2.  Thus,  our  approximation  ofy(l)  is  already  correct  to  eight  dec¬ 
imal  places  when  h  =  0.025.  Equivalently,  we  need  only  choose  TV  >40  to 
achieve  eight  decimal  places  accuracy. 

To  put  the  accuracy  of  the  various  schemes  into  proper  perspective,  let 
us  say  that  we  have  three  different  schemes  for  numerically  solving  the  ini¬ 
tial-value  problem  dy  / dt=  f(t,y),y( 0)  =  0  on  the  interval  0<  t  <  1,  and  that 
the  error  we  make  in  using  these  schemes  is  3 h,  11  h2,  and  42/i4,  respec¬ 
tively.  If  our  problem  is  such  that  we  require  eight  decimal  places  ac¬ 
curacy,  then  the  step  sizes  hv  h2,  and  h3  of  these  three  schemes  must  satisfy 
the  inequalities  3 hx  <  10“8,  11  h2<  10-8,  and  42h3<  10-8.  Hence,  the  num¬ 
ber  of  iterations  Nl9  N2 ,  and  N3  of  these  three  schemes  must  satisfy  the  in¬ 
equalities 


and 


Nx  >  3  X  108  =  300,000,000,  TV2  >  VY\  X  104« 34,000 

N3>( 42)1/4X  102«260. 


This  is  a  striking  example  of  the  difference  between  the  Runge-Kutta 
method  and  the  Euler,  improved  Euler  and  three  term  Taylor  series 
methods. 


Remark.  It  should  be  noted  that  we  perform  four  functional  evaluations  at 
each  step  in  the  Runge-Kutta  method,  whereas  we  only  perform  one  func¬ 
tional  evaluation  at  each  step  in  Euler’s  method.  Nevertheless,  the 
Runge-Kutta  method  still  beats  the  heck  out  of  Euler’s  method,  the  three 
term  Taylor  series  method,  and  the  improved  Euler  method. 


Exercises 

Using  the  Runge-Kutta  method  with  h  —  0.\9  determine  an  approximate 
value  of  the  solution  at  t=  1  for  each  of  the  initial-value  problems  1-5.  Re¬ 
peat  these  computations  with  h  =  0.025  and  compare  the  results  with  the 
given  value  of  the  solution. 

1.  dy/ dt=\  +  t—y,  >>(0)=0;  (y(0=0 

2.  dy/dt  =  2ty ,  y( 0)  =  2;  (y(/)  =  le*2) 

3«  dy/dt=\+y2  —  t2,  >>(0)  =  0;  (y(/)=0 

4.  dy/dt=te~y  +  t/(\  +  t2),  >>(0)  =  0;  0>(0  =  ln(l  + 12)) 

5.  dy/dt=  -l+2t+y2/((\  +  t2)2),  y(0)=l;  (y(t)=\  +  t2) 
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6.  Using  the  Runge-Kutta  method  with  h  =  tt/40,  determine  an  approximate  value 
of  the  solution  of  the  initial-value  problem 

^  =2sec2/-(l  +y2),  y(  0)  =  0 

at  /  =  7t/4.  Repeat  these  computations  with  /i  =  tt / 1 60  and  compare  the  results 
with  the  number  one  which  is  the  value  of  the  solution y(t)  =  tan/  at  f  =  fl-/4. 


1.17  What  to  do  in  practice 

In  this  section  we  discuss  some  of  the  practical  problems  which  arise  when 
we  attempt  to  solve  differential  equations  on  the  computer.  First,  and  fore¬ 
most,  is  the  problem  of  estimating  the  error  that  we  make.  It  is  not  too  dif¬ 
ficult  to  show  that  the  error  we  make  using  Euler’s  method,  the  three  term 
Taylor  series  method,  the  improved  Euler  method  and  the  Runge-Kutta 
method  with  step  size  h  is  at  most  cxh,  c2h2,  c3h2,  and  c4/*4,  respectively. 
With  one  exception,  though,  it  is  practically  impossible  to  find  the  con¬ 
stants  c{,  c2 ,  c3,  and  c4.  The  one  exception  is  Euler’s  method  where  we  can 
explicitly  estimate  (see  Section  1.13.1)  the  error  we  make  in  approximating 
y(tk)  by  yk.  However,  this  estimate  is  not  very  useful,  since  it  is  only  valid 
for  tk  sufficiently  close  to  t0 ,  and  we  are  usually  interested  in  the  values  of 
y  at  times  t  much  larger  than  t0.  Thus,  we  usually  do  not  know,  a  priori, 
how  small  to  choose  the  step  size  h  so  as  to  achieve  a  desired  accuracy.  We 
only  know  that  the  approximate  values  yk  that  we  compute  get  closer  and 
closer  to  y(tk)  as  h  gets  smaller  and  smaller. 

One  way  of  resolving  this  difficulty  is  as  follows.  Using  one  of  the 
schemes  presented  in  the  previous  section,  we  choose  a  step  size  h  and 

compute  numbers  y j, _ _ yN.  We  then  repeat  the  computations  with  a  step 

size  h/2  and  compare  the  results.  If  the  changes  are  greater  than  we  are 
willing  to  accept,  then  it  is  necessary  to  use  a  smaller  step  size.  We  keep 
repeating  this  process  until  we  achieve  a  desired  accuracy.  For  example, 
suppose  that  we  require  the  solution  of  the  initial-value  problem  y'=f(t,y), 
y(0)=y0  at  t=  1  to  four  decimal  places  accuracy.  We  choose  a  step  size  h 
=  1/100,  say,  and  compute  y\,-^,ym.  We  then  repeat  these  computations 
with  h  =  1/200  and  obtain  new  approximations  z1,...,z200.  If  y100  and  z200 
agree  in  their  first  four  decimal  places  then  we  take  z200  as  our  approxima¬ 
tion  of  y(l).*  If  yl0 o  and  z200  do  not  agree  in  their  first  four  decimal  places, 
then  we  repeat  our  computations  with  step  size  h  =  1  /400. 

Example  1.  Find  the  solution  of  the  initial- value  problem 


*This  does  not  guarantee  that  z200  agrees  with  y(l)  to  four  decimal  places.  As  an  added  pre¬ 
caution,  we  might  halve  the  step  size  again.  If  the  first  four  decimal  places  still  remain  un¬ 
changed,  then  we  can  be  reasonably  certain  that  z20 o  agrees  with  y(l)  to  four  decimal  places. 
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-^=y(  l  +  e-')  +  e‘,  y(0)  =  0 

at  t  =  1  to  four  decimal  places  accuracy. 

Solution.  We  illustrate  how  to  try  and  solve  this  problem  using  Euler’s 
method,  the  three  term  Taylor  series  method,  the  improved  Euler  method, 
and  the  Runge-Kutta  method. 

(i)  Euler's  method : 

Pascal  Program 

Program  Euler  (input,  output); 
var 

T,  Y:  array [0. .999]  of  real; 
a,  h:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  := a/N; 

page; 

for  k:=0  to  N  —  1  do 
begin 

T[k+1]  :=T[k]  +  h; 

Y[k  + 1]  := Y[k]  +  h  *  (Y[k]  *  (1  +  exp(- Y[k]))  +  exp(T[k])); 
end; 

writeln(‘N’:4,  ‘h’:10,  'Y[N]':20); 
writeln(N:4, 1  ’:2,h:10:7,‘  ’:2,  Y[N]:18:10); 
end. 

Fortran  Program 


DIMENSION  T(1 000),  Y(1000) 

Section  A 

READ  (5,10)T(1),Y(1),A,  N, 

Read  in  data 

10 

FORMAT  (3F20.8, 15) 

H  =  A/N 

r 

i 

DO  20  K  =  1,N 

Section  B 

T(K  +  1)  =  T(K)  +  H 

Do  computations  - 

i 

Y(K  + 1 )  =  Y(K)  +  H  *  (Y(K)  *  (1  +  EXP(  -  Y(K))) 

1 

+  EXP(T(K») 

.20 

CONTINUE 

WRITE  (6,30)  N,H,Y(N  +  1) 

Section  C 

30 

FORMAT  (1 H,  1 X,  15, 2X,  FI  0.7, 4X,  F20.9) 

Print  out 

CALL  EXIT 

results 

- 

END 
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See  also  C  Program  8  in  Appendix  C  for  a  sample  C  program. 

We  set  A  =  1,  T[0]  =0,  7[0]  =  0  (T(l)=  7(1)  =  0  in  the  Fortran  program)  and 
ran  these  programs  for  N=  10,  20,  40,  80,  160,  320,  and  640.  The  results  of 
these  computations  are  given  in  Table  1.  Notice  that  even  with  a  step  size  h 


Table  1 


N 

h 

ys 

10 

0.1 

2.76183168 

20 

0.05 

2.93832741 

40 

0.025 

3.03202759 

80 

0.0125 

3.08034440 

160 

0.00625 

3.10488352 

320 

0.003125 

3.11725009 

640 

0.0015625 

3.12345786 

as  small  as  1/640,  we  can  only  guarantee  an  accuracy  of  one  decimal 
place.  This  points  out  the  limitation  of  Euler’s  method.  Since  N  is  so  large 
already,  it  is  wiser  to  use  a  more  accurate  scheme  than  to  keep  choosing 
smaller  and  smaller  step  sizes  h  for  Euler’s  method. 

(ii)  The  three  term  Taylor  series  method 


Pascal  Program 

Program  Taylor  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  DY1,  DY2:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  := a/N; 

page; 

for  k:=0  to  N  —  1  do 
begin 

T[k  +  1]:=T[k]  +  h; 

DY1  :=  1  +  (1  —  Y[k])  *  exp(  — Y[k]); 

DY2  :=Y[k]  *  (1  +  exp(  — Y[k]))  +  exp(T[k]); 

Y[k+ 1]  :=Y[k]  +  h  *  DY2  +  (h  *  h/2)  *  (exp(T[k])  +  DY1  *  DY2); 
end; 

writeln(‘N’:4,  ‘h’:10,  ‘Y[N]’:20); 
writeln(N:4,  ‘  ’:2,h:10:7,‘  :2,  Y[N]:18:10); 
end. 
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Fortran  Program 


Replace  Section  B  of  the  previous  Fortran  program  by 


20 


D020  K  =  1 ,  N 

T(K  +  1)  =  T(K)  +  H 

DY1  =  1  +  (1  -  Y(K))  *  EXP(  -  Y(K)) 

DY2  =  Y(K)  *  (1  +  EXP(  -  Y(K)))  +  EXP(T(K)) 

Y(K  + 1 )  =  Y(K)  +  H  *  DY2  +  (H  *  H/2)  *  (EXP(T(K))  +  DY1  *  DY2) 
CONTINUE 


See  also  C  Program  9  in  Appendix  C  for  a  sample  C  program. 

We  set  A  =  l,  T[0]=0,  and  7[0]  =  0  (T(1)  =  0,  and  7(1)  =  0  in  the  Fortran 
program)  and  ran  these  programs  for  N=  10,  20,  40,  60,  80,  160,  and  320. 
The  results  of  these  computations  are  given  in  Table  2.  Observe  that  yl60 


Table  2 


N 

h 

yN 

10 

0.1 

3.11727674 

20 

0.05 

3.12645293 

40 

0.025 

3.12885845 

80 

0.0125 

3.12947408 

160 

0.00625 

3.12962979 

320 

0.003125 

3.12966689 

and  y320  agree  in  their  first  four  decimal  places.  Hence  the  approximation 
y(l)  =  3. 12966689  is  correct  to  four  decimal  places. 

(iii)  The  improved  Euler  method 

Pascal  Program 

Program  Improved  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  R1,  R2:  real; 
k,  N:  integer; 

begin 

readln(T[0],  Y[0],  a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  — 1  do 
begin 

T[k+1]:=T[k]  +  h; 
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R1  :=Y[k]*(1  +exp(-Y[k]))  +  exp(T[k]); 

R2  := (Y[k]  +  h  *  R1)  *  (1  +  exp(  —  (Y[k]  +  h  *  R1 )))  +  exp(T(k  + 1]); 
Y[k  + 1]  := Y[k]  +  (h/2)  *  (R1  +  R2); 
end; 

writeln(‘N’:4,  ‘h’:10,  ‘Y[N]’:20); 
writeln(N:4,  ‘  ’:2,h:10:7,‘  ’:2,  Y[N]:18:10); 
end. 


Fortran  Program 


Replace  Section  B  of  the  first  Fortran  program  in  this  section  by 


20 


DO  20  K  =  1,N 
T(K  +  1)  =  T(K)  +  H 

R1  =  Y(K)  *  (1  +  EXP(  -  Y(K)))  +  EXP(T(K)) 

R2  =  (Y(K)  +  H  *  R1 )  *  (1  +  EXP(  -  (Y(K)  +  H  *  R1 )))  +  EXP(T(K  + 1 )) 
Y(K  + 1 )  =  Y(K)  +  (H/2)  *  (R1  +  R2) 

CONTINUE 


See  also  C  Program  10  in  Appendix  C  for  a  sample  C  program. 

We  set  A  =  1,  T[ 0]  =  0  and  Y[0] = 0  (T(l) = 0  and  Y(l) = 0  in  the  Fortran  pro¬ 
gram)  and  ran  these  programs  for  N  =  10,  20,  40,  80,  160,  and  320.  The  re¬ 
sults  of  these  computations  are  given  in  Table  3.  Observe  that  y150  and  _y320 


Table  3 


N 

h 

yN 

10 

0.1 

3.11450908 

20 

0.05 

3.12560685 

40 

0.025 

3.1286243 

80 

0.0125 

3.12941247 

160 

0.00625 

3.12961399 

320 

0.003125 

3.12964943 

agree  in  their  first  four  decimal  places.  Hence  the  approximation  y(l)  = 
3.12964943  is  correct  to  four  decimal  places. 

(iv)  The  method  of  Runge-Kutta 


Pascal  Program 


Program  Runge-Kutta  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  LK1,  LK2,  LK3,  LK4:  real; 
k,  N:  integer; 
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begin 

readln(T[0],  Y[0],a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  —  1  do 
begin 

T[k+1]  :=T[k]  +  h; 

LK1  :=  (Y[k]  *  (1  +exp(- Y[k]))  +  exp(T[k]); 

LK2  :=  (Y[k]  +  (h/2)  *  LK1)  *  (1  +  exp(  -  (Y[k]  +  (h/2)  *  LK1))) 
+  exp(T[k]  +  (h/2)); 

LK3  :=  (Y[k]  +  (h/2)  *  LK2)  *  (1  +  exp(  -  (Y[k]  +  (h/2)  *  LK2))) 
+  exp(T[k]  +  (h/2)); 

LK4  :=  (Y[k]  +  h  *  LK3)  *  (1+  exp(  -  (Y[k]  +  h  *  LK3))) 

+  exp(T[k+1]); 

Y[k  + 1  ]  := Y[k]  +  (h/6)  *  (LK1  +  2*LK2  +  2*LK3  +  LK4); 
end; 

writeln(‘N’:4,  ‘h’:10,  ‘Y[N]’:20); 
writeln(N:4,  ‘  :2,  h:10:7,  ‘  :2,  Y[N]:18:10); 

0  0  1  10 


Fortran  Program 


Replace  Section  B  of  the  first  Fortran  program  in  this  section  by 


1 

1 

1 


DO  20  K  =  1 ,  N 
T(K  +  1)  =  T(K)  +  H 

LK1  =  Y(K)  *  (1  +  EXP(  -  Y(K))  +  EXP(T(K)) 

LK2  =  (Y(K)  +  (H /2)  *  LK1 )  *  (1  +  EXP(  -  (Y(K)  +  (H /2)  *  LK1 ))) 
+  EXP(T(K)  +  (H /2)) 

LK3  =  (Y(K)  +  (H/2)  *  LK2)  *  (1  +  EXP(  -  (Y(K)  +  (H /2)  *  LK2))) 
+  EXP(T(K)  +  (H /2)) 

LK4  =  (Y(K)  +  H  *  LK3)  *  (1  +  EXP(  -  (Y(K)  +  H  *  LK3))) 

+  EXP(T(K  + 1 )) 

Y(K  + 1 )  =  Y(K)  +  (H/6)  *  (LK1  +  2  *  LK2  +  2  *  LK3  +  LK4) 
CONTINUE 


See  also  C  Program  1 1  in  Appendix  C  for  a  sample  C  program. 

We  set  A  =  l,  T[0]  =  0and  7[0]=0(T(l)  =  0and  7(1)  =  0  in  the  Fortran  pro¬ 
gram)  and  ran  these  programs  for  N=  10,  20,  40,  80,  160,  and  320.  The  re¬ 
sults  of  these  computations  are  given  in  Table  4.  Notice  that  our  approxi¬ 
mation  of  y{\)  is  already  correct  to  four  decimal  places  with  h  =  0.1,  and 
that  it  is  already  correct  to  eight  decimal  places  with  A  =  0.00625(A=  160). 
This  example  again  illustrates  the  power  of  the  Runge-Kutta  method. 

We  conclude  this  section  with  two  examples  which  point  out  some  addi¬ 
tional  difficulties  which  may  arise  when  we  solve  initial-value  problems  on 
a  digital  computer. 
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Table  4 


N 

h 

ys 

10 

0.1 

3.1296517 

20 

0.05 

3.12967998 

40 

0.025 

3.1296819 

80 

0.0125 

3.12968203 

160 

0.00625 

3.12968204 

320 

0.003125 

3.12968204 

Example  2.  Use  the  Runge-Kutta  method  to  find  approximate  values  of 
the  solution  of  the  initial-value  problem 

^■  =  ?2+72,  ^(0)=1 

at  the  points  tk  =  k/ N,  k= 

Solution. 


Pascal  Program 

Program  Runge_Kutta  (input,  output); 
var 

T,  Y:  array [0.. 999]  of  real; 
a,  h,  LK1,  LK2,  LK3,  LK4:  real; 
k,  N:  integer; 

begin 

readln(T[0],Y[0],a,  N); 

h  :=a/N; 

page; 

for  k:=0  to  N  — 1  do 
begin 

T[k  +  1]:=T[k]  +  h; 

LK1  :=T[k]  *  T[k]  +  Y[k]  *  Y[k]; 

LK2  :=  (T[k]  +  h/2)  *  (T[k]  +  h/2) 

+  (Y[k]  +  (h/2)  *  LK1)  *  (Y[k]  +  (h/2)  *  LK1); 

LK3  :=  (T[k]  +  h/2)  *  (T[k]  +  h/2) 

+  (Y[k]  +  (h/2)  *  LK2)  *  (Y[k]  +  (h/2)  *  LK2); 
LK4:=(T[k]  +  h)*(T[k]  +  h) 

+  (Y[k]  +  h  *  LK3)  *  (Y[k]  +  h  *  LK3); 

Y[k  + 1]  :=Y[k]  +  (h/6)  * (LK1 +  2  *  LK2  +  2  *  LK3  +  LK4); 
end; 

writeln(T:4,  ‘Y’:15); 
for  k:=0  to  N  do 
writeln(T[k]:10:7,  ‘  ’:2,  Y[k]:16:9); 

end. 
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Fortran  Program 


Replace  Sections  B  and  C  of  the  first  Fortran  program  in  this  section  by 


r 


Section  B 
Do  computa¬ 
tions 


DO  20  K  =  1 ,  N 

T(K  +  1 )  =  T(K)  +  H 

LK1  =T(K)*  *2  + Y(K)*  *2 

LK2  =  (T(K)  +  (H/2))*  *2  +  (Y(K)  +  (H/2)*LK1)*  *2 

LK3  =  (T(K)  +  (H/2))**2  +  (Y(K)  +  (H/2) * LK2) *  *2 

LK4  =  (T(K)  +  H) *  * 2  +  (Y(K)  +  H * LK3) *  *2 

Y(K  + 1 )  =  Y(K)  +  (H/6)  *  (LK1  +  2*  LK2  +  2  *  LK3  +  LK4) 

CONTINUE 


r 


Section  C 
Print  out 
results 


30 


NA=N+1 

WRITE  (6, 30)  (T(J),  Y(J),  J  =  1 ,  NA) 

FORMAT  (1  HI ,  3X,  1 HT,  4X,  1 HY /(I  H,  1 X,  F9.7, 
2X.F20.9/)) 

CALL  EXIT 
END 


See  also  C  Program  12  in  Appendix  C  for  a  sample  C  program. 

We  attempted  to  run  these  programs  with  A  =  1,  T[0]  =  0,  Y[0]  =  1  (T(l)  =  0, 
and  7(1)=  1  in  the  Fortran  program)  and  N=  10,  but  we  received  an  error 
message  that  the  numbers  being  computed  exceeded  the  domain  of  the 
computer.  That  is  to  say,  they  were  larger  than  1038.  This  indicates  that  the 
solution  y(t)  goes  to  infinity  somewhere  in  the  interval  [0, 1].  We  can  prove 
this  analytically,  and  even  obtain  an  estimate  of  where  y(t)  goes  to  infinity, 
by  the  following  clever  argument.  Observe  that  for  0<  t  <  1,  y(t)  is  never 
less  than  the  solution  <J>1(0=  1  /(I  ~  0  of  the  initial-value  problem 

Tt=y2’  ^(0)=1- 

In  addition,  y(t)  never  exceeds  the  solution  <#>2(0  =  tan(/  + 77-/4)  of  the  ini¬ 
tial-value  problem  dy /dt=  1  +y2,  y(0)  =  1.  Hence,  for  0  <  t  <  1, 

■:-j_  <  y(t)  <  tan(/  +  77-/4). 

This  situation  is  described  graphically  in  Figure  1.  Since  and  <j>2(0  be¬ 
come  infinite  at  t=l  and  /  =  7t/4  respectively,  we  conclude  thaty(/)  be¬ 
comes  infinite  somewhere  between  7r /4  and  1. 

The  solutions  of  most  initial-value  problems  which  arise  in  physical  and 
biological  applications  exist  for  all  future  time.  Thus,  we  need  not  be 
overly  concerned  with  the  problem  of  solutions  going  to  infinity  in  finite 
time  or  the  problem  of  solutions  becoming  exceedingly  large.  On  the  other 
hand,  though,  there  are  several  instances  in  economics  where  this  problem 
is  of  paramount  importance.  In  these  instances,  we  are  often  interested  in 
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determining  whether  certain  differential  equations  can  accurately  model  a 
given  economic  phenomenon.  It  is  often  possible  to  eliminate  several  of 
these  equations  by  showing  that  they  allow  solutions  which  are  unrealisti¬ 
cally  large. 


Example  3.  Use  Euler’s  method  to  determine  approximate  values  of  the 
solution  of  the  initial-value  problem 

^•=>I|>,r3/4+/siny,  y(0)  =  0  (1) 


at  the  points  1/ N,2/ N,...,2. 

Solution.  The  programming  for  this  problem  is  simplified  immensely  by 
observing  that 


y\y\  3y/4  =  (sgn>?)|>?|1/49  where  sgny 


1,  y>0 
<  0,  y  =  0 

-l,  y<0 

V. 


Pascal  Program 

Program  Euler  (input,  output); 
const 

PI  =  3.141592654; 
var 

T,  Y:  array [0.  .999]  of  real; 
h:  real; 
k,  N:  integer; 
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begin 
readln(N); 
page; 
h  :=2/N; 

T[1]  :=h; 

Y[1]:=0; 

for  k:=1  to  N  — 1  do 
begin 

T[k  +  1]:=T[k]  +  h; 

if  Y[k]  =  0  then  Y[k  +  1]  :=h  *T[k]  *sin(PI/T[k])  else 
Y[k  +  1]  :=Y[k]  +  h  *(Y[k]  *exp((  — 3/4)*  ln(abs(Y[k]))) 

+T[k]  *sin(PI/T[k])); 

end; 

writeln(‘T’:4,  ‘Y’:15); 
for  k  :=  1  to  N  do 
writeln(T[k]:10:7,  ‘  ’:2,  Y[k]:16:9); 

end. 


Fortran  Program 


10 


20 

30 


DIMENSION  T(1 000),  Y(1 000) 

READ  (5, 10)  N 
FORMAT  (15) 

H  =  2/N 
T(1)=H 
Y(1)  =  0 
DO  20  K  =  2,  N 
T(K)  =  T(K-1)  +  H 

Y(K)  =  Y(K  —  1 )  +  H  *  (SIGN  (Y(K  —  1 ))  *  ABS(Y  (K  —  1 ))  *  *  0.25 
+  T(K  — 1)*SIN(3.141592654/T(K  — 1))) 

CONTINUE 

WRITE(6, 30)  0, 0,  (T(J),  Y(J),  J  =  1 ,  N) 

FORMAT  (1  HI ,  3X,  1 HT,  4X,  1  HY/(1  H,  1 X,  FI  0.7, 2X,  F20.9 /)) 

CALL  EXIT 

END 


See  also  C  Program  13  in  Appendix  C  for  a  sample  C  program. 

When  we  set  A  =  25  we  obtained  the  value  2.4844172  for  y  (2),  but  when  we 
set  A  =  27,  we  obtained  the  value  —0.50244575  for  y (2).  Moreover,  all  the 
yk  were  positive  for  A  =  25  and  negative  for  A  =  27.  We  repeated  these 
computations  with  A  =  89  and  A=91  and  obtained  the  values  2.64286349 
and  —0.6318074  respectively.  In  addition,  all  they*  were  again  positive  for 
A  =  89  and  negative  for  A  =  91.  Indeed,  it  is  possible,  but  rather  difficult, 
to  prove  that  all  they*  will  be  positive  if  A  =  1,5,9, 13, 17,...  and  negative 
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if  TV  =  3, 7,  11, 15, ...  .  This  suggests  that  the  solution  of  the  initial-value 
problem  (1)  is  not  unique.  We  cannot  prove  this  analytically,  since  we 
cannot  solve  the  differential  equation  explicitly.  It  should  be  noted  though, 
that  the  existence -uniqueness  theorem  of  Section  1.10  does  not  apply  here, 
since  the  partial  derivative  with  respect  toy  of  the  function  |y|  ~3'*y  + 
/sin m / 1  does  not  exist  aty  =  0. 

Most  of  the  initial-value  problems  that  arise  in  applications  have  unique 
solutions.  Thus,  we  need  not  be  overly  concerned  with  the  problem  of  non¬ 
uniqueness  of  solutions.  However,  we  should  always  bear  in  mind  that  ini¬ 
tial-value  problems  which  do  not  obey  the  hypotheses  of  the  existence- 
uniqueness  theorem  of  Section  1.10  might  possess  more  than  one  solution, 
for  the  consequences  of  picking  the  wrong  solution  in  these  rare  instances 
can  often  be  catastrophic. 


Exercises 


In  each  of  Problems  1-5,  find  the  solution  of  the  given  initial-value  prob¬ 
lem  at  t  —  1  to  four  decimal  places  accuracy. 

1.  *L-y  +  e-r  +  2t,  >>(0)  =  0 

3  ^  = 
dt 


t2+y2 
1  +  t+y: 


,  y(0)  =  0 


dy 


5.  =  ty2-y ,  ^(0)=1 


2.  J  =  1  -t+y2,  ^(0)  =  0 
4.  ^  =  e‘y2-2y,  ^(0)=1 
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2.1  Algebraic  properties  of  solutions 

A  second-order  differential  equation  is  an  equation  of  the  form 


d2y 

1? 


(i) 


For  example,  the  equation 


d2y  .  ,,,  ,(dy\2 

dt2  y  \dt) 

is  a  second-order  differential  equation.  A  function  y=y(t)  is  a  solution  of 
(1)  ify(0  satisfies  the  differential  equation;  that  is 

d2y(f)  fl,  (A  M‘)\ 


Thus,  the  function  y(t)  =  cos  t  is  a  solution  of  the  second-order  equation 
d2y / dt2  —  —y  since  d\cost) / dt2  =  —  cos/. 

Second-order  differential  equations  arise  quite  often  in  applications. 
The  most  famous  second-order  differential  equation  is  Newton’s  second 
law  of  motion  (see  Section  1.7) 


which  governs  the  motion  of  a  particle  of  mass  m  moving  under  the  in¬ 
fluence  of  a  force  F.  In  this  equation,  m  is  the  mass  of  the  particle,  y  =y(t) 
is  its  position  at  time  /,  dy/dt  is  its  velocity,  and  F  is  the  total  force  acting 
on  the  particle.  As  the  notation  suggests,  the  force  F  may  depend  on  the 
position  and  velocity  of  the  particle,  as  well  as  on  time. 
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In  addition  to  the  differential  equation  (1),  we  will  often  impose  initial 
conditions  on  y{t)  of  the  form 

y(to)=y&  y'{to)=y'o •  O') 

The  differential  equation  (1)  together  with  the  initial  conditions  (T)  is  re¬ 
ferred  to  as  an  initial-value  problem.  For  example,  let  y(t)*  denote  the 
position  at  time  t  of  a  particle  moving  under  the  influence  of  gravity.  Then, 
y{t)  satisfies  the  initial-value  problem 

d2y 

-rj  =  -g;  y{t0)=yo>  /OoWo. 


where^  is  the  initial  position  of  the  particle  and^o  is  the  initial  velocity  of 
the  particle. 

Second-order  differential  equations  are  extremely  difficult  to  solve.  This 
should  not  come  as  a  great  surprise  to  us  after  our  experience  with  first- 
order  equations.  We  will  only  succeed  in  solving  the  special  differential 
equation 

^+p(t)^  +  <i(t)y=g(t)-  (2) 


Fortunately,  though,  many  of  the  second-order  equations  that  arise  in  ap¬ 
plications  are  of  this  form. 

The  differential  equation  (2)  is  called  a  second-order  linear  differential 
equation.  We  single  out  this  equation  and  call  it  linear  because  both>>  and 
dy/dt  appear  by  themselves.  For  example,  the  differential  equations 


and 


d2y  ,dy 

dt2  dt  * 


are  linear,  while  the  differential  equations 


and 


d2y  „  dy  , 

— 7  +  3  -j-  +  sin y  =  t 3 
dt 2  dt  y 


^+(%Y- 1 

dt2  \dt ) 


are  both  nonlinear,  due  to  the  presence  of  the  siny  and  {dy/dt)2  terms,  re¬ 
spectively. 

We  consider  first  the  second-order  linear  homogeneous  equation 


(3) 


•The  positive  direction  of  y  is  taken  upwards. 
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which  is  obtained  from  (2)  by  setting  g(/)  =  0.  It  is  certainly  not  obvious  at 
this  point  how  to  find  all  the  solutions  of  (3),  or  how  to  solve  the  initial- 
value  problem 

^+p{t)^  +  q(t)y=0-,  y{t0)=yo,  y'(t0)=y'0.  (4) 

Therefore,  before  trying  to  develop  any  elaborate  procedures  for  solving 
(4),  we  should  first  determine  whether  it  actually  has  a  solution.  This  infor¬ 
mation  is  contained  in  the  following  theorem,  whose  proof  will  be  indi¬ 
cated  in  Chapter  4. 

Theorem  1.  (Existence-uniqueness  Theorem).  Let  the  functions  p(t)  and 
q(t )  be  continuous  in  the  open  interval  a<t<fi.  Then,  there  exists  one, 
and  only  one  function  y{t)  satisfying  the  differential  equation  (3)  on  the  en¬ 
tire  interval  a<  t<  ft,  and  the  prescribed  initial  conditions  y(t0)=y0,  y'(t0) 
=y q.  In  particular,  any  solution  y  =  y  (t)  of  (3)  which  satisfies  y(t0)  =  0  and 
y\t 0)  =  0  at  some  time  t  =  t0  must  be  identically  zero. 

Theorem  1  is  an  extremely  important  theorem  for  us.  On  the  one  hand, 
it  is  our  hunting  license  to  find  the  unique  solution  y(t)  of  (4).  And,  on  the 
other  hand,  we  will  actually  use  Theorem  1  to  help  us  find  all  the  solutions 
of  (3). 

We  begin  our  analysis  of  Equation  (3)  with  the  important  observation 
that  the  left-hand  side 

y"+p(t)y'+q(t)y  (y  =li,y' =  ^) 

of  the  differential  equation  can  be  viewed  as  defining  a  “function  of  a 
function”:  with  each  function  y  having  two  derivatives,  we  associate 
another  function,  which  we’ll  call  L[y],  by  the  relation 

L[y]{t)=y”{t)+p(t)y’{t)  +  q{t)y{t). 

In  mathematical  terminology,  L  is  an  operator  which  operates  on  func¬ 
tions;  that  is,  there  is  a  prescribed  recipe  for  associating  with  each  function 
y  a  new  function  L[y]. 

Example  1.  Let  p(t)  =  0  and  q(t)=  t.  Then, 

L[y](t)=y"(t)  +  ty(t). 

If  y(/)  =  cosL  then 

L[y](t)  =  (cost)"  +  tcost  =  (t—  l)cos  /, 
and  if  y(t)=  t3,  then 

L[y](t)  =  (ty  +  t(t*)  =  t*  +  6t. 
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Thus,  the  operator  L  assigns  the  function  (t—  l)cos/  to  the  function  cos/, 
and  the  function  6/  + 14  to  the  function  t3. 

The  concept  of  an  operator  acting  on  functions,  or  a  “function  of  a 
function”  is  analogous  to  that  of  a  function  of  a  single  variable  t.  Recall 
the  definition  of  a  function  /  on  an  interval  /:  with  each  number  /  in  I  we 
associate  a  new  number  called  /(/).  In  an  exactly  analogous  manner,  we 
associate  with  each  function^  having  two  derivatives  a  new  function  called 
L[y).  This  is  an  extremely  sophisticated  mathematical  concept,  because  in 
a  certain  sense,  we  are  treating  a  function  exactly  as  we  do  a  point. 
Admittedly,  this  is  quite  difficult  to  grasp.  It’s  not  surprising,  therefore, 
that  the  concept  of  a  “function  of  a  function”  was  not  developed  till  the 
beginning  of  this  century,  and  that  many  of  the  “high  powered”  theorems 
of  mathematical  analysis  were  proved  only  after  this  concept  was 
mastered. 

We  now  derive  several  important  properties  of  the  operator  L,  which  we 
will  use  to  great  advantage  shortly. 

Property  1.  L[cy]  =  cL[y\  for  any  constant  c. 

Proof.  L[cy]{t)  =  (cy)"(t)  +p(t)(cyY(t)  +  q(t)(cy)(t) 

=  cy"{t)+cp{t)y'{t)  +  cq(t)y(t) 

=  c[y"(t)+  P(t)y\t)  +  q{t)y(t)] 

=  cL[y}{t).  □ 

The  meaning  of  Property  1  is  that  the  operator  L  assigns  to  the  function 
(cy)  c  times  the  function  it  assigns  to  y.  For  example,  let 

L[y]{t)=y"{t)  +  (>y'(t)-2y{t). 

This  operator  L  assigns  the  function 

(t2)"  +  6(t2)'-2(t2)  =  2+l2t-2t2 

to  the  function  t2.  Hence,  L  must  assign  the  function  5(2+  12t-2/2)  to  the 
function  5 12. 

Property  2.  L[yx  +y2]  =  L[yx]+  L[y2]. 

Proof. 

L[yi+y2](t)  =  (y\+y2)"(t)+p(t)(yi+y2)'(t)  +  <](‘)(yi+y2)(t) 

=y\(.t)+y'i{t)+p{t)y\{t)+p{t)y'1{t)  +  q(t)yx{t)  +  q{t)y2{t) 

=  [y\{t)+p(t)y'\{t)  +  q{t)y\{t)]  +  [y2(t)+p{t)y'2{t)  +  q{t)y2{t)] 

=  L[y{](t)  +  L[y2](t).  □ 

The  meaning  of  Property  2  is  that  the  operator  L  assigns  to  the  function 
y,+j>2  fhe  sum  of  the  functions  it  assigns  toj>j  and>>2.  For  example,  let 

L[y](t)=y"(t)  +  2y'(t)-y(t). 
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This  operator  L  assigns  the  function 

(cos  t ) "  +  2(cos  /)'  —  cos  /  =  —  2  cos  t  —  2  sin  t 
to  the  function  cos  t,  and  the  function 

(sin  t) "  4-  2(sin  t)'  —  sin  t  =  2  cos  t  —  2  sin  t 
to  the  function  sin  t.  Hence,  L  assigns  the  function 

( —  2  cos  /  —  2  sin  /)  +  2  cos  t  —  2  sin  /  =  —  4  sin  / 
to  the  function  sin/  +  cosf. 


Definition.  An  operator  L  which  assigns  functions  to  functions  and  which 
satisfies  Properties  1  and  2  is  called  a  linear  operator.  All  other  opera¬ 
tors  are  nonlinear.  An  example  of  a  nonlinear  operator  is 


L[y]{t)=y"{t)-2t[y(t)]A. 
This  operator  assigns  the  function 


to  the  function  1  / /,  and  the  function 


2c(l  —  c3) 


to  the  function  c / 1.  Hence,  for  c¥=0, 1,  and >>(/)=  1  //,  we  see  that  L[cy] 
=£cL[y], 


The  usefulness  of  Properties  1  and  2  lies  in  the  observation  that  the 
solutions  y{t)  of  the  differential  equation  (3)  are  exactly  those  functions  y 
for  which 

L[y](t)=y"{t)+p(t)y’{t)  +  q(t)y(t)  =  0. 

In  other  words,  the  solutions  y(t)  of  (3)  are  exactly  those  functions  y  to 
which  the  operator  L  assigns  the  zero  function*  Hence,  iiy(t)  is  a  solution 
of  (3)  then  so  is  cy(t)9  since 

L[cy](f)  =  cL[>>](/)  =  0. 

If  yx(t)  andy2(0  are  solutions  of  (3),  then y\(t)+y2(t)  is  also  a  solution  of 
(3),  since 

^[>;l+>;2](0  =  ^[>;l](0  +  ^[>;2](0  =  0  +  0=::0- 

Combining  Properties  1  and  2,  we  see  that  all  linear  combinations 

Ciyx{t)  +  c2y2{t) 

of  solutions  of  (3)  are  again  solutions  of  (3). 

*The  zero  function  is  the  function  whose  value  at  any  time  t  is  zero. 
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The  preceding  argument  shows  that  we  can  use  our  knowledge  of  two 
solutions  yx( t)  and  y2(t)  of  (3)  to  generate  infinitely  many  other  solutions. 
This  statement  has  some  very  interesting  implications.  Consider,  for  exam¬ 
ple,  the  differential  equation 


+y=  0. 


Two  solutions  of  (5)  are ^(7)  =  cos/  andy2(/)  =  sin/.  Hence, 

y  ( t)  =  Cj  cos  t  +  c2  sin  t  (6) 

is  also  a  solution  of  (5),  for  every  choice  of  constants  cx  and  c2.  Now, 
Equation  (6)  contains  two  arbitrary  constants.  It  is  natural  to  suspect, 
therefore,  that  this  expression  represents  the  general  solution  of  (5);  that  is, 
every  solution  y(t)  of  (5)  must  be  of  the  form  (6).  This  is  indeed  the  case, 
as  we  now  show.  Let  y(t)  be  any  solution  of  (5).  By  the  existence-unique¬ 
ness  theorem,  y(t)  exists  for  all  t.  Let  j>(0)=>>0,  y(0)=j>o,  and  consider  the 
function 

<#>(. t)  =  y0  cos  t  +  y'0  sin  t. 

This  function  is  a  solution  of  (5)  since  it  is  a  linear  combination  of  solu¬ 
tions  of  (5).  Moreover,  <HO)=y0  and  <t>'(0)=yQ.  Thus,  y(t)  and  <j>(t)  satisfy 
the  same  second-order  linear  homogeneous  equation  and  the  same  initial 
conditions.  Therefore,  by  the  uniqueness  part  of  Theorem  l,y(t)  must  be 
identically  equal  to  </>(t),  so  that 

y(0~yo  cos  t -h  y  q  sin  t. 

Thus,  Equation  (6)  is  indeed  the  general  solution  of  (5). 

Let  us  return  now  to  the  general  linear  equation  (3).  Suppose,  in  some 
manner,  that  we  manage  to  find  two  solutions  yx(t)  and  y2(t)  of  (3).  Then, 
every  function 

y{t)=ciy\{t)+c2y2{t)  (7) 

is  again  a  solution  of  (3).  Does  the  expression  (7)  represent  the  general 
solution  of  (3)?  That  is  to  say,  does  every  solution  y(t)  of  (3)  have  the  form 
(7)?  The  following  theorem  answers  this  question. 


Theorem  2.  Let yx(t)  and y2(t)  be  two  solutions  of  (3)  on  the  interval  a<t< 
/?,  with 

y\ (0^2(0 -y\ (0^(0 

unequal  to  zero  in  this  interval.  Then , 

y{t)=c\yi{t)+c2y2{t) 
is  the  general  solution  of  (3). 

Proof.  Let  y(t)  be  any  solution  of  (3).  We  must  find  constants  c,  and  c2 
such  that y(t)^ clyl(t)+  c2y2(t).  To  this  end,  pick  a  time  t0  in  the  interval 
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(a,  13)  and  let and  denote  the  values  of  y  and>>'  at  t  =  tQ.  The  constants 
cx  and  c2,  if  they  exist,  must  satisfy  the  two  equations 

c\y\{to)+ciy2{to)=yo 

C\y\{.t0)  +  c2y'1{t(i)=y'0. 

Multiplying  the  first  equation  by  >>2(/0),  the  second  equation  by  y2(t 0)  and 
subtracting  gives 


ci[yi(h)y'2(t0)-y\(  t0)  y2  ( t0)  ]  =y0y2  ( tQ)  -y'0y2(  '<>)• 

Similarly,  multiplying  the  first  equation  by  y\(t0 ),  the  second  equation  by 
yx(t0)  and  subtracting  gives 

ci[  y'i  OoMto) -yiMyiM  ]  =yoy\ (*0)  -/o^i  (*0)- 


Hence, 


and 


yMQ-y'oyA^) 

yA^y'ii^-yii^yiOo) 

y'oy\{k)-yoy'\{to) 
yi(to)y2(to)  ~y\ ( to)y2(to ) 


y\{t^y2{t^-y\{t^y2{t^0.  Now,  let 

</,(0  =  ^l>'l(0  +  C2>'2(0 

for  this  choice  of  constants  cvc2.  We  know  that  <f>(t)  satisfies  (3),  since  it  is 
a  linear  combination  of  solutions  of  (3).  Moreover,  by  construction,  <t>(t0)  = 
y0  and  ^'(to)s=y,0-  Thus,  y(t)  and  </>(7)  satisfy  the  same  second-order  linear 
homogeneous  equation  and  the  same  initial  conditions.  Therefore,  by  the 
uniqueness  part  of  Theorem  \,y(t)  must  be  identically  equal  to  that 
is, 

y(t)  =  clyl(t)  +  c2y2(t),  a<t<p.  □ 

Theorem  2  is  an  extremely  useful  theorem  since  it  reduces  the  problem 
of  finding  all  solutions  of  (3),  of  which  there  are  infinitely  many,  to  the 
much  simpler  problem  of  finding  just  two  solutions  yx(t\y2(t).  The  only 
condition  imposed  on  the  solutions  yft)  and  y2(t )  is  that  the  quantity 
y\(t)yi(t)~y\(t)y2{t)  he  unequal  to  zero  for  a<t</3.  When  this  is  the 
case,  we  say  thaty^t)  andj>2(0  are  a  fundamental  set  of  solutions  of  (3), 
since  all  other  solutions  of  (3)  can  be  obtained  by  taking  linear  combina¬ 
tions  oiyft)  andj>2(7). 


Definition.  The  quantity  yx{t)y2{t)-y\{t)y2{t)  is  called  the  Wronskian  of 
yx  andj>2,  and  is  denoted  by  W(t)  =  W[yX9yf$t). 

Theorem  2  requires  that  W[yXiyf\{t)  be  unequal  to  zero  at  all  points  in 
the  interval  (a,  /?).  In  actual  fact,  the  Wronskian  of  any  two  solutions 
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y ft),y2{t)  of  (3)  is  either  identically  zero,  or  is  never  zero,  as  we  now 
show. 

Theorem  3.  Let  p(t)  and  q(t)  be  continuous  in  the  interval  a<t<j 8,  and  let 
yft)  and  y2{t)  be  two  solutions  of  (3).  Then ,  W[yx,yf\(t)  is  either  identi¬ 
cally  zero ,  or  is  never  zero ,  on  the  interval  a<t<({. 

We  prove  Theorem  3  with  the  aid  of  the  following  lemma. 

Lemma  1.  Let  y  ft)  and  y2(t)  be  two  solutions  of  the  linear  differential  equa¬ 
tion  y"  +  p(t)yf  4-  q(t)y  =  0.  Then,  their  Wronskian 

W^-W^y^-y^y^-y^y^t) 

satisfies  the  first-order  differential  equation 

W'+p(t)W=  0. 


Proof.  Observe  that 

~y  1^2  +y\y\-y\y'2-y\y2 

=y\y'2-y'[y2- 

Since  yx  and  y2  are  solutions  of  y"  +  p{t)y'  +  q{t)y  =  0,  we  know  that 

y'i  =  -p{t)y,2-ci{t)y2 

and 

yi=  -p{0y\-9(t)yi. 

Hence, 

w'(t)  =y  i  [  -p{t)y2-q(t)y2]  ~yi[  ~p{t)y\ -  q(t)y\  ] 

=  -p{t)[yiy2-y\yi] 

=  -p{t)W(t).  □ 

We  can  now  give  a  very  simple  proof  of  Theorem  3. 

Proof  of  theorem  3.  Pick  any  t0  in  the  interval  (a,  ft).  From  Lemma  1, 

w[y\^2}{t)=  ^[7i»^2](^o)exp| -JT  p(j)rfA 

Now,  exp is  unequal  to  zero  for  a<t</3.  Therefore, 
is  either  identically  zero,  or  is  never  zero.  □ 
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The  simplest  situation  where  the  Wronskian  of  two  functions  yx(t),y2(t) 
vanishes  identically  is  when  one  of  the  functions  is  identically  zero.  More 
generally,  the  Wronskian  of  two  functions  y1(/),y2(0  vanishes  identically  if 
one  of  the  functions  is  a  constant  multiple  of  the  other.  If  y2=cyp  say> 
then 

-y  i  (wi)' -y\  (^1) =°- 

Conversely,  suppose  that  the  Wronskian  of  two  solutions  y\{t),y2{t)  of  (3) 
vanishes  identically.  Then,  one  of  these  solutions  must  be  a  constant  multi¬ 
ple  of  the  other,  as  we  now  show. 

Theorem  4.  Let  y  ft)  and  y2(t)  be  two  solutions  of  (3)  on  the  interval  a<t< 
f3 ,  and  suppose  that  W[ylyyf\(t^  —  0  for  some  t0  in  this  interval.  Then ,  one 
of  these  solutions  is  a  constant  multiple  of  the  other. 

Proof  #1.  Suppose  that  W[yx^y^{t^  =  Q.  Then,  the  equations 


C\y\{to)  +  c2y2(t0)  =  0 
c\y'\{to)^c2y'2{to)  =  0 

have  a  nontrivial  solution  c„e2;  that  is,  a  solution  c,,c2  with  c’il  +  |c2j 
Lely(t)=cxy  i(/)+  c2y2(0»  for  this  choice  of  constants  c,,c2.  We  know  that 
y(t)  is  a  solution  of  (3),  since  it  is  a  linear  combination  ofy^/)  andy2(/). 
Moreover,  by  construction,  y(f0)==0  andy'(7o)  =  0-  Therefore,  by  Theorem 
1  ,y(t)  is  identically  zero,  so  that 

C\ yx  (0  +  c2y2(  t)  =  0,  a<t<  (3, 

If  cx¥=  0,  then  >>!(/)=  —  {c2/ cx)y2(t),  and  if  c2  0,  theny2(/)  = 
—  (c i  /  cf)  y  ft)>  In  either  case,  one  of  these  solutions  is  a  constant  multiple 
of  the  other.  □ 

Proof  #2.  Suppose  that  W[yvy2](t^)  =  0.  Then,  by  Theorem  3, 
W[yvy2 ](0  is  identically  zero.  Assume  that  yft)y2{t)¥zQ  for  a<t</3. 
Then,  dividing  both  sides  of  the  equation 

y\  (0/2(0 -/i  (0^2(0”° 

*yy\(.t)y2(t)  sives 

y'i{t)  y\(t) 

yzO )  y\(‘) 

This  equation  implies  that  yft)  =  cy2(t)  for  some  constant  c. 

Next,  suppose  that  yft)y2(t)  is  zero  at  some  point  /  =  /*  in  the  interval 
a<t</3.  Without  loss  of  generality,  we  may  assume  that  y ft*)  =  0,  since 
otherwise  we  can  relabel  yx  andy2.  In  this  case  it  is  simple  to  show  (see  Ex¬ 
ercise  19)  that  either  yft)  =  Q,  or  y2{t)  =  [y2{t*)/y\(t*)\yft).  This  com¬ 
pletes  the  proof  of  Theorem  4.  □ 
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Definition.  The  functions  yx(t)  and  y2{t)  are  said  to  be  linearly  dependent 
on  an  interval  I  if  one  of  these  functions  is  a  constant  multiple  of  the 
other  on  /.  The  functions  yx(t)  andj>2(0  are  said  t0  be  linearly  indepen¬ 
dent  on  an  interval  I  if  they  are  not  linearly  dependent  on  /. 

Corollary  to  Theorem  4.  Two  solutions  yx{t)  and  y2(t)  of  (3)  are  linearly  in¬ 
dependent  on  the  interval  a<t<  fi  if,  and  only  if,  their  Wronskian  is  un¬ 
equal  to  zero  on  this  interval.  Thus,  two  solutions  yx(t)  and  y2(t)  form  a 
fundamental  set  of  solutions  of  (3)  on  the  interval  a  <  /  <  /?  if,  and  only  if, 
they  are  linearly  independent  on  this  interval. 

Exercises 

1.  Let  L[y](t)=y"(t)-3ty'(t)  +  3y(t).  Compute 

(a^L[e'],  (b)  L[cosV3  t],  (c)  L[ 2e'  +  4cosV3  /], 

(d)  L[t2],  (e)  L[5t2],  (f)  L[t\,  (g)  L[t2  +  3t]. 

2.  Let  L[y](t)=y"(t)-6y'(t)  +  5y(t).  Compute 

(a)  L[e‘],  (b)  Lie2'],  (c)  L[e3'],  (d)  L[erl], 

(e)  L[t],  (f)  L[t2},  (g)  L[/2  +  2/]. 

3.  Show  that  the  operator  L  defined  by 

L[^](t)=  fs2y(s)ds 
J  a 

is  linear;  that  is,  L[cy]  =  cL[y]  and  L[yx+y2]  =  L[yx]  +  L[y2], 

4.  Let  L[y](t)-y"(t)^-p(t)y'(t)  + q(t)y(t),  and  suppose  that  L[t2]=t+\  and 
L[t]=*2t  +  2.  Show  thaty(0=  t-2t2  is  a  solution  of  y”  +p(t)y'  +  q(t)y  =  0. 

5.  (a)  Show  that  >>!(/)=  v7  and  y2(0=  1  / 1  are  solutions  of  the  differential  equa¬ 

tion 


2t2y”  +  3ty'  —y  =  0  (*) 

on  the  interval  0  <  t  <  oo. 

(b)  Compute  W[yX9y2](t)'  What  happens  as  t  approaches  zero? 

(c)  Show  that  >^(0  and  y2(t)  form  a  fundamental  set  of  solutions  of  (*)  on  the 
interval  0<  t<  oo. 

(d)  Solve  the  initial-value  problem  2t2y"  +  3  ty'  —y  =  0;  y(l)=2,y'(l)s=  1- 

6.  (a)  Show  that  yft)  —  e~t2^2  and  y2(t)=s  e'*2/2  j^es2/2ds  are  solutions  of 


y"  +  ty'  +y  =  0  (*) 

on  the  interval  -  oo  <  t  <  oo. 

(b)  Compute  W[yx,y2](t). 

(c)  Show  thatyj  andy2  form  a  fundamental  set  of  solutions  of  (*)  on  the  inter¬ 
val  -  00  <  t<  00. 

(d)  Solve  the  initial- value  problem  yn  +  ty'  +y  =  0;  y (0)  =  0,  y'(O)  *  L 
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7.  Compute  the  Wronskian  of  the  following  pairs  of  functions. 

(a)  sin  at,  cos  bt  (b)  sin2 1 , 1  —  cos  2 1 

(c)  eat,ebt  (d)  eat,teat 

(e)  t,t\nt  (f)  eat  sin  bt,eat  cos  bt 

8.  Let  yx{t)  and  y2(t)  be  solutions  of  (3)  on  the  interval  —  oo  <  t<  oo  with,y1(0)  = 
^.vKO)*  1, >^2(^)  =  and ^2(0) =  Show  thaty^O  and>>2(0  are  linearly  depen¬ 
dent  on  the  interval  —  00  <  t  <00. 

9.  (a)  Let yx(t)  and y2(t)  be  solutions  of  (3)  on  the  interval  a<t<P,  withy1(/0)  = 

1,  =  yiOo)  =  0,  and  ^2(^0) =  1*  Show  that  yx(t)  and  y2{t)  form  a 

fundamental  set  of  solutions  of  (3)  on  the  interval  a<t</3. 

(b)  Show  th&t  y(t)=y0yl(t)+y'0y2(t)  is  the  solution  of  (3)  satisfying  y(t0)=y0 
and  /Oo)  =/)• 

10.  Show  that  y(t)  =  t2  can  never  be  a  solution  of  (3)  if  the  functions  pit)  and  q(t) 
are  continuous  at  t  =  0. 

11.  Lety1(/)=/2  and>>2(0= 

(a)  Show  that  and  j>2  are  linearly  dependent  on  the  interval  0<  t  <  1. 

(b)  Show  that^!  and  are  linearly  independent  on  the  interval  —  1  <  t  <  1. 

(c)  Show  that  W[yx,y^\{t)  is  identically  zero. 

(d)  Show  that^!  andy2  can  never  be  two  solutions  of  (3)  on  the  interval  —  1  <  t 
<  1  if  both  p  and  q  are  continuous  in  this  interval. 

12.  Suppose  that  and  y2  are  linearly  independent  on  an  interval  I.  Prove  that  zx 

=-y\+y2  and  z2=yx  —  y2  are  also  linearly  independent  on  /. 

13.  Let  yx  andy2  be  solutions  of  Bessel’s  equation 

t2y"  +  ty'  +  (t2-  n2)y  =  0 

on  the  interval  0<t<  oo,  with >^(1)=  l,y'1(l)  =  0, =  ^  andy2(l)  =  L  Com¬ 
pute  W[yhy2](t). 

14.  Suppose  that  the  Wronskian  of  any  two  solutions  of  (3)  is  constant  in  time. 
Prove  that  p(t)  =  0. 

In  Problems  15-18,  assume  that  p  and  q  are  continuous,  and  that  the  func¬ 
tions  y  x  and  j>2  are  solutions  of  the  differential  equation 

y"  +P(t)y'  +  q(t)y  =  0 


on  the  interval  a<t</3. 

15.  Prove  that  if  yx  and  y2  vanish  at  the  same  point  in  the  interval  a<t<  /?,  then 
they  cannot  form  a  fundamental  set  of  solutions  on  this  interval. 

16.  Prove  that  if  yx  andy2  achieve  a  maximum  or  minimum  at  the  same  point  in 
the  interval  a<t</3 ,  then  they  cannot  form  a  fundamental  set  of  solutions  on 
this  interval. 

17.  Prove  that  if  yx  and  y2  are  a  fundamental  set  of  solutions,  then  they  cannot 
have  a  common  point  of  inflection  in  a  <  t  <  ft  unless  p  and  q  vanish  simulta¬ 
neously  there. 
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18.  Suppose  thatjq  andy2  are  a  fundamental  set  of  solutions  on  the  interval  —  00 
<t<  00.  Show  that  there  is  one  and  only  one  zero  of  yx  between  consecutive 
zeros  ofj>2-  Hint:  Differentiate  the  quantity  y^/yx  and  use  Rolle’s  Theorem. 

19.  Suppose  that  Wly^y^t^^O,  and,  in  addition,  y!(^*)  =  0.  Prove  that  either 
>>i(/)  =  0  ox  y2{i)  =  [y'2(t*)/y'x(t*)\yx(t).  Hint:  If  W[yl9y2](t*)  =  0  andy1(/*)  =  0, 
then  y2(t*)y\(t*)  =  0. 


2.2  Linear  equations  with  constant  coefficients 

We  consider  now  the  homogeneous  linear  second-order  equation  with  con¬ 
stant  coefficients 


L[y]=a^+b^+cy=0  0) 

where  a ,  b ,  and  c  are  constants,  with  0.  Theorem  2  of  Section  2.1  tells 
us  that  we  need  only  find  two  independent  solutions^  and >>2  °f  0); 
other  solutions  of  (1)  are  then  obtained  by  taking  linear  combinations  of  yx 
andy2.  Unfortunately,  Theorem  2  doesn’t  tell  us  how  to  find  two  solutions 
of  (1).  Therefore,  we  will  try  an  educated  guess.  To  this  end,  observe  that  a 
function  y(t)  is  a  solution  of  (1)  if  a  constant  times  its  second  derivative, 
plus  another  constant  times  its  first  derivative,  plus  a  third  constant  times 
itself  is  identically  zero.  In  other  words,  the  three  terms  ay",  by',  and  cy 
must  cancel  each  other.  In  general,  this  can  only  occur  if  the  three  func¬ 
tions  y(t),  y'(t),  and>>"(0  are  of  the  “same  type”.  For  example,  the  func¬ 
tion  y(t)=t5  can  never  be  a  solution  of  (1)  since  the  three  terms  20 at3, 
5bt4,  and  ct5  are  polynomials  in  t  of  different  degree,  and  therefore  cannot 
cancel  each  other.  On  the  other  hand,  the  function  y{t)  —  ert,  r  constant, 
has  the  property  that  both>>'(0  andy"(0  are  multiples  of  y{t).  This  sug¬ 
gests  that  we  try  y(t)  =  ert  as  a  solution  of  (1).  Computing 

L[ert]  =  a{ert)"  +  b{ert)'  +  c(ert) 

=  (ar2+  br+  c)ert, 

we  see  that  y{t)  —  ert  is  a  solution  of  (1)  if,  and  only  if 

ar 2  +  ftr  +  c  =  0.  (2) 

Equation  (2)  is  called  the  characteristic  equation  of  (1).  It  has  two  roots 
rvr2  given  by  the  quadratic  formula 

—  b+^/b2  —  4ac  _  —b—\Jb2  —  4  ac 

ri“  2  a  >  r2~  2a 

If  b2-4ac  is  positive,  then  rx  and  r2  are  real  and  distinct.  In  this  case,^/) 
=  er,/  and  y2(t)=:  ^  are  two  distinct  solutions  of  (1).  These  solutions  are 
clearly  linearly  independent  (on  any  interval  I),  since  erit  is  obviously  not 
a  constant  multiple  of  erit  for  r2^rv  (If  the  reader  is  unconvinced  of  this 
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he  can  compute 

W[  e r»',  e r*  ]  =  ( r2 - r  x  )e(r' + 

and  observe  that  W  is  never  zero.  Hence,  erit  and  erjt  are  linearly  indepen¬ 
dent  on  any  interval  /.) 


Example  1.  Find  the  general  solution  of  the  equation 


dy 

+  5f+4,  =  0. 


Solution.  The  characteristic  equation  r2  +  5r  +  4  =  (r  +  4)(r+  1)  =  0  has  two 
distinct  roots  rx  =  —  4  and  r2  =  —  1 .  Thus, y x ( t)  =  e ~ 4/  and y2( t)  =  e~*  form  a 
fundamental  set  of  solutions  of  (3),  and  every  solution  y(t)  of  (3)  is  of  the 
form 


y(t)  =  cxe  4,+  c2e  ‘ 
for  some  choice  of  constants  cvc2. 


Example  2.  Find  the  solution  y(t)  of  the  initial-value  problem 

d2y  dy 

-Z+4J--2y=0-,  y( 0)  =  1,  /( 0)  =  2. 

Solution.  The  characteristic  equation  r2  +  4r  —  2  =  0  has  2  roots 


and 


-4+  VT6  +  8 


-2+V6 


-4-V16  +  8  =_2_V6  . 

2 

Hence,  yx(t)  =  erit  and  y2(t)  =  erit  are  a  fundamental  set  of  solutions  of 
y  "  +  4y '  —  2y  =  0,  so  that 

y(t)  =  c1e(~2'h^)'+  c2e(~2~v^)t 

for  some  choice  of  constants  cx,c2.  The  constants  cx  and  c2  are  determined 
from  the  initial  conditions 

c1  +  c2=l  and  (  —  2+  V6  )cj  +  (  —  2—  V6  )c2  =  2. 

From  the  first  equation,  c2=l-c1.  Substituting  this  value  of  c2  into  the 
second  equation  gives 

(  —  2+  V6  )cx  —  (2+  V6  )(1  —  cx)  =  2,  or  2 V6  cx  =  4+  V6  . 
Therefore,  cx  =2/ \f6  +  c2=  l  —  cx  =  ^  —  2/V6  ,  and 
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Exercises 


Find  the  general  solution  of  each  of  the  following  equations. 

d2y  „  dy 


d2y 

1.  -4-y  =  0 


3. 


dr 

d2y  .  dy 
— 7  ~  3-1-  +  7  =  0 
dt 2  dt 


2.  6’~-l^-+y  =  d 

dt 2  <# 

4.  3  — —  +  6  -77  +  27  —  0 
<ft2  dt 


Solve  each  of  the  following  initial-value  problems. 
5-  ^(0)=h/(0)=0 

6.  2^  +  J-10y  =  0;  >>(1)  =  5,/(1)  =  2 

dy 

7-  5-^+5-^-7  =  °;  7(0)  =  0,7/(0)=l 


dt 


_  flf27  <#> 

8-  ^_65+>,=0: 


Remark.  In  doing  Problems  6  and  8,  observe  that  er('"'o)  is  also  a  solution 
of  the  differential  equation  ay"  +  by' +  cy  =  0  if  ar2  +  6r  +  c  =  0.  Thus,  to 
find  the  solution y(t)  of  the  initial-value  problem  ay"  +  by' +  cy  =  0;  7(/0)  = 
y0,y'(t0)=y'0,  we  would  write  y{t)  —  cleri{t~to)  +  c2er2^~to)  and  solve  for  c, 
and  c2  from  the  initial  conditions. 

9.  Let  7(0  be  the  solution  of  the  initial-value  problem 

+5^  +6y  =  °;  y(0)  =  1,  y'(0)=V. 

For  what  values  of  V  does  y(t)  remain  nonnegative  for  all  t  >0? 

10.  The  differential  equation 


L  [y]  =  t2y"  +  aty'  +  0y  =  0  (*) 

is  known  as  Euler’s  equation.  Observe  that  t2y ",  ty\  and  y  are  all  multiples  of 
tr  if  7  =  tr.  This  suggests  that  we  try  y  =  tr  as  a  solution  of  (*).  Show  that y  »  tr 
is  a  solution  of  (*)  if  r2  +  (a-  l)r  +  /?  =  0. 

11.  Find  the  general  solution  of  the  equation 

t2y"  +  5ty'  -57  =  0,  t>  0 

12.  Solve  the  initial-value  problem 

fV-fy'-2y=0;  ^(1)  =  0,  /(1)=1 

on  the  interval  0 <  t  <  oo. 
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2.2.1  Complex  roots 

If  b2  —  4ac  is  negative,  then  the  characteristic  equation  ar2+br+c  =  0  has 
complex  roots 


—  b  +  /V 4ac  —  b2  A  —b  —  i^j4ac  —  b2 

r i  = - - -  and  r~>  = - 


2a  ““  7  2  2a 

We  would  like  to  say  that  erxt  and  erit  are  solutions  of  the  differential 
equation 


d2y  ^udy  ^  n 

a — -  +b  —  +  cy  =0. 
dt2  dt 

However,  this  presents  us  with  two  serious  difficulties.  On  the  one  hand, 
the  function  ert  is  not  defined,  as  yet,  for  r  complex.  And  on  the  other 
hand,  even  if  we  succeed  in  defining  er{t  and  erit  as  complex-valued  solu¬ 
tions  of  (1),  we  are  still  faced  with  the  problem  of  finding  two  real-valued 
solutions  of  (1). 

We  begin  by  resolving  the  second  difficulty,  since  otherwise  there’s  no 
sense  tackling  the  first  problem.  Assume  that  y(t)=  u(t)+  iv(t)  is  a  com¬ 
plex-valued  solution  of  (1).  This  means,  of  course,  that 

a[  u"(t)  +  iv"{t)  ]  +  b[uft)  +  iv'(t)  ]  +  c[  u(t)  +  iv(t)]  =  0.  (2) 

This  complex-valued  solution  of  (1)  gives  rise  to  two  real- valued  solutions, 
as  we  now  show. 


Lemma  1.  Let  y(t)  =  u(t)  +  iv(t)  be  a  complex-valued  solution  of  ( 1),  with  a, 
b ,  and  c  real.  Then,  yx(t)=  u(t)  and  y2(t)  =  v(t)  are  two  real-valued  solu¬ 
tions  of  (1).  In  other  words ,  both  the  real  and  imaginary  parts  of  a  com¬ 
plex-valued  solution  of  (1)  are  solutions  of  (1).  ( The  imaginary  part  of  the 
complex  number  a  +  i/3  is  (3.  Similarly ,  the  imaginary  part  of  the  function 
u(t)+iv(t)  is  v(t).) 

Proof.  From  Equation  (2), 

[au"(t)  +  bu\t)  +  cu(t)]  +  i[av"(t)  +  bv'(t)  +  cv(t)]  =0.  (3) 

Now,  if  a  complex  number  is  zero,  then  both  its  real  and  imaginary  parts 
must  be  zero.  Consequently, 

au"(t)  +  bu'(t)  +  cu(t)  =  0  and  av"(t)  +  bv'(t)  +  cv(t)  =  0, 
and  this  proves  Lemma  1 .  □ 

The  problem  of  defining  ert  for  r  complex  can  also  be  resolved  quite 
easily.  Let  r  =  a  +  i/3.  By  the  law  of  exponents, 

ert  =  eatenst .  (4) 

Thus,  we  need  only  define  the  quantity  for  /3  real.  To  this  end,  recall 
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that 


x 3 

e*=l  +  x+|j-  +  |j-  +  .... 


(5) 


Equation  (5)  makes  sense,  formally,  even  for  x  complex.  This  suggests  that 
we  set 


(ipt)2  (ipt)3 

eipt=\  +  ipt+—J-  +  —=r-  +  .... 


2! 


3! 


Next,  observe  that 


{ifitf  p2t2  ip3t3  pY  //?¥ 

\  +  ipl+-^-  +  ...^l  +  ipt--^. - ^  +  +  + 


5,5 


2! 


2!  3! 

P2t2  pY 

1 - - —  H - — —  +  . . . 


4! 


5! 


+  i 


fi3t3  (i5t5 
fit  Ti  ^  _ 5^  ^ 


2!  '  4!  '  "•  I  '  'I  1  3! 

=  cos/?/  +  i  sin  fit. 

Hence, 

e(a  +  ip)t  =  eatei/3t  _  £at  (cos  fa  +  /  sin/?/).  (6) 

Returning  to  the  differential  equation  (1),  we  see  that 
y  (t)  =  el~  b  +  4aC~  t2  J'/2" 

=  e~ bt/2a [ cos  V 4ac  —  b 2  t / 2a  +  i sin  V 4ac  —  b2  t/2a^ 

is  a  complex-valued  solution  of  (1)  if  b2  —  4ac  is  negative.  Therefore,  by 
Lemma  1, 


yx(t)  =  e  bt/2acos fit  and  y2{t)  =  e  ^2crsin fit,  /? = 


V4  ac  —  b2 
2  a 


are  two  real-valued  solutions  of  (1).  These  two  functions  are  linearly  inde¬ 
pendent  on  any  interval  /,  since  their  Wronskian  (see  Exercise  10)  is  never 
zero.  Consequently,  the  general  solution  of  (1)  for  b2  —  4ac<0  is 


y{t)  —  e  bt/2a[clcosfit+  c2sin/?/],  /?  = 


V4ac-b2 
2  a 


Remark  1.  Strictly  speaking,  we  must  verify  that  the  formula 


vi  fl  ft 

—re  —  re 


dt 

is  true  even  for  r  complex,  before  we  can  assert  that  er '*  and  erit  are  com¬ 
plex-valued  solutions  of  (1).  To  this  end,  we  compute 

A-e(<*  +  iP)t _  jLe<xt[cosfit+  /sin/?/] 

=  eat  [  (a  cos  /?/  —  /?  sin  /?/)  +  i{a  sin  fit  +  /?  cos  /?/)  ] 
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and  this  equals  (a  +  ii 6)e(a  +  ,/3)',  since 

(a  +  i(i  +  =  («  +  //}  )ea'[cos/fr  +  /  sin/?/] 

=  e  at  [  (  a  cos  fit  —  /?  sin  fit)  +  /  ( a  sin  /it  4-  /?  cos  /?/)  ] . 
Thus,  ( d/dt)ert  =  rert ,  even  for  r  complex. 

Remark  2.  At  first  glance,  one  might  think  that  erit  would  give  rise  to  two 
additional  solutions  of  (1).  This  is  not  the  case,  though,  since 

er*  =  e~ (b/2a)‘e ~ i/3‘,  (3  =  V4 ac-b2  /2 a 

=  e_ft'/2o[cos(  —  fit)  +  /sin(  —  /Jf)]  =  e_*'/2a[cos)S/  —  /  sin  ySf  ] . 

Hence, 

Re{er2' }  =  e~bt/2acosfit  =>>,(/) 

and 

\m{er2t }  =  —  e  ~  bt^2a  sin  fit  =  — y2(0- 

Example  1.  Find  two  linearly  independent  real-valued  solutions  of  the  dif¬ 
ferential  equation 

.  d2y  dy 

4^+4f+^-°-  <7> 

Solution.  The  characteristic  equation  4r2  +  4r  +  5  =  0  has  complex  roots  rx 
=  —  \  +  /  and  r2  =  —\  —  i.  Consequently, 

er\f  =  e(-1/2+0t=  e~*/2cost  +  ie~^2sint 
is  a  complex-valued  solution  of  (7).  Therefore,  by  Lemma  1, 

Re{e'v  }  =  e_'/2cos t  and  \m{eryt }  =  e~t/2s\nt 

are  two  linearly  independent  real-valued  solutions  of  (7). 

Example  2.  Find  the  solution  y{t)  of  the  initial-value  problem 

d2y  dy 

-_Z+2^+4y=0;  y(0)=l,  /(0)-l. 

Solution.  The  characteristic  equation  r2  +  2r  +  4  =  0  has  complex  roots 
—  1  +  V3  i  and  r2  =  —  1  —  V3  i.  Hence, 

erit  =  e(~l  +  y^ l^t=e~tcos  V3  /  +  ie~‘sin  V3  t 
is  a  complex-valued  solution  of  y"  +  2y'  +  4y  =  0.  Therefore,  by  Lemma  1, 
both 

Re{eri' }  =  e-/cos  V3  t  and  Im{eri' }  =  e~'sin  V3  t 
are  real-valued  solutions.  Consequently, 

y(t)  =  e~t[cl cos  V3  /  +  c2sin  V3  /] 
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for  some  choice  of  constants  cvc2.  The  constants  cx  and  c2  are  determined 
from  the  initial  conditions 


and 


1  =>’(0)  =  c 


1  =>''(0)  =  —  c,  +  V3  c2. 


This  implies  that 
c  i =  hc2  = 


V3 


and  y(t)  =  e  ' 


2 

cos  VI  t  H — —  sin  V3  t 

VI 


Exercises 


Find  the  general  solution  of  each  of  the  following  equations. 

d2y 


.  d2y  dy 

i.  -4  +  4+>-=o 

dt 2  dt 

,  d2y  ,.dy 
3.  — 7+2~T-+3y  =  0 
dt 2  dt 


2.  2 — i-  +3-4-  +4y  =  0 


dy_ 

dt 


4.  4 


dt 2 
</2y  A' 

— -■  — -j-  +  y  =  0 

dt2  dt 


Solve  each  of  the  following  initial-value  problems. 
+  0;  y(0)=l,/(0)=-2 


6.  ^+2^+5y  =  0;  ^(0)=0, /(0)  =  2 

7.  Assume  that  b2  —  4ac  <  0.  Show  that 

y  i  (0  =  e(  ~  “ /o)  cos  /3(t  — 10) 

and 


^2(0  =  e<_  b/2a){'  _<o)sin/3  (t- 10),  p  =  — 4a^~& 

are  solutions  of  (1),  for  any  number  t0. 

Solve  each  of  the  following  initial-value  problems. 

8.  2^£  — ^+3y=0;  /(1)=1,  /(1)=1 

9.  3^ -2  J  +4y=0;  /(2)=1, /(2)= -1 

10.  Verify  that  JF[ea/cos/fr,  ea,sin/fr]  =  /fe2a/. 

11.  Show  that  eiut  is  a  complex-valued  solution  of  the  differential  equation  y"  + 
co2y~  0.  Find  two  real-valued  solutions. 

12.  Show  that  (cos  t  +  i  sin  t)r  =  cos  rt  +  /  sin  rt.  Use  this  result  to  obtain  the  double 
angle  formulas  sin  It =2  sin  /cos/  and  cos  It  =  cos2  /-sin2/. 
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13.  Show  that 

(cos  tx  +  /sin/,)(cosZ2  +  /  sin  t2)  =  cos(tx  +  /2)  +  zsin^j  +  Z2). 

Use  this  result  to  obtain  the  trigonometric  identities 

cos(  1 1  + 1 2 )  =  cos  t  j  cos  t2  —  sin  /  j  sin  z2, 
sin(/j  4-  /2)  =  sin/1cos/2  +  cos/,sin/2. 

14.  Show  that  any  complex  number  a  +  ib  can  be  written  in  the  form  Ae 10 ,  where 
A  =  Va2  +  b2  and  tan0  =  b/a. 

15.  Defining  the  two  possible  square  roots  of  a  complex  number  Aeld  as 
±  VA  ei0/f 2,  compute  the  square  roots  of  z,  1  +  z,  -  z,  V7. 

16.  Use  Problem  14  to  find  the  three  cube  roots  of  z. 

17.  (a)  Let  rx  =  A  +  //x  be  a  complex  root  of  r2  +  (a  -  l)r  +  /?  =  0.  Show  that 

tx +,'/A  =  =  /A[cos  n  In  t  +  z  sin  /x  In  /] 


is  a  complex-valued  solution  of  Euler’s  equation 


»_0. 

dt 2  A 


(b)  Show  that  zAcosjuln/  and  zA  sin /a  In/  are  real-valued  solutions  of  (*). 
Find  the  general  solution  of  each  of  the  following  equations. 


18.  t 


~>d2y  dy 

-+t-+y^  ,>0 


19.  t2 


+  2t 


<fy_ 

dt 


+  2y  =  0, 


/  >0 


2.2.2  Equal  roots ;  reduction  of  order 

If  b2  =  4ac,  then  the  characteristic  equation  ar2  +  br+c  =  0  has  real  equal 
roots  rx  =  r2  =  —b/2a.  In  this  case,  we  obtain  only  one  solution 

yx  (/)  =  e~bt/2a 

of  the  differential  equation 

d2y  dy  .  . 

a — -+b—+cy  =  0.  (1) 

dt2  dt  7  w 

Our  problem  is  to  find  a  second  solution  which  is  independent  of  yx.  One 
approach  to  this  problem  is  to  try  some  additional  guesses.  A  second,  and 
much  more  clever  approach  is  to  try  and  use  our  knowledge  of  >>,(/)  to  help 
us  find  a  second  independent  solution.  More  generally,  suppose  that  we 
know  one  solution  y=yx(t)  of  the  second-order  linear  equation 

.  d2y  ,  fa 

L[y]^—^+p{t)-^  +  q(t)y=a.  (2) 

Can  we  use  this  solution  to  help  us  find  a  second  independent  solution? 
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The  answer  to  this  question  is  yes.  Once  we  find  one  solution  y=yx(t)  of 
(2),  we  can  reduce  the  problem  of  finding  all  solutions  of  (2)  to  that  of 
solving  a  first-order  linear  homogeneous  equation.  This  is  accomplished  by 
defining  a  new  dependent  variable  v  through  the  substitution 

Then 

dy  _  dyl  dv 

dt  V  dt  dt 

and 


d2y  d2yx  dv  dy  \  d2 v 

v — —  +2—  — —  +y  i 


dt 


dt 2 


dt  dt 


Consequently. 


rr  i  t^dv^y i  ,  d2 v  , 

L[y]=v—  +2-x  —  +*— +p(Q 


=y  i 


=y  i 


dt 2 

l/2t> 

A2 

J2t> 


dt  '  1  dV 
dyx 

+  \2~df+p(t)yi 

dy , 

+  I2-^-+p(07, 


6to 


+ 


V 
' 2 , 


rf/2 


+Jl  dt 


+  ?(0«Vi 


+/»(0-JT +9(0^1 


dv 


dt 


V 


dv 
dt  9 


since  yx(t)  is  a  solution  of  L[y]  =  0.  Hence,  y(t)=yx(t)v(t)  is  a  solution  of 
(2)  if  t;  satisfies  the  differential  equation 


y\ 


+ 


dy  i 

2  ~dF+P{t)y' 


^=o. 

dt 


Now,  observe  that  Equation  (3)  is  really  a  first-order  linear  equation  for 
dv/ dt.  Its  solution  is 


dv 

dt 


r 

y',(t)  1 

cexp 

"/ 

1 

+ 

<N 

_ 1 

dt 

> 

Since  we  only  need  one  solution  v(t)  of  (3),  we  set  c=  1  in  (4).  Integrating 
this  equation  with  respect  to  t ,  and  setting  the  constant  of  integration  equal 
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to  zero,  we  obtain  that  v 


(0  =  j u(t)dt ,  where 


Hence, 


y2(t)  =  v(t)yl{t)=yl(t)  j  u(t)dt  (6) 

is  a  second  solution  of  (2).  This  solution  is  independent  of  yv  for  ify2(0 
were  a  constant  multiple  ofy^f)  then  v(t)  would  be  constant,  and  conse¬ 
quently,  its  derivative  would  vanish  identically.  However,  from  (4) 

exp 
dv  _ 

dt  y\il) 


and  this  quantity  is  never  zero. 

Remark  1.  In  writing  v{t)  =  J  u(t)dt ,  we  set  the  constant  of  integration 

equal  to  zero.  Choosing  a  nonzero  constant  of  integration  would  only  add 
a  constant  multiple  of  >^(7)  to  y2(t).  Similarly,  the  effect  of  choosing  a  con¬ 
stant  c  other  than  one  in  Equation  (4)  would  be  to  multiply  y2(t)  by  c. 


Remark  2.  The  method  we  have  just  presented  for  solving  Equation  (2)  is 
known  as  the  method  of  reduction  of  order,  since  the  substitution  y(7)  = 
y\(t)v(t)  reduces  the  problem  of  solving  the  second-order  equation  (2)  to 
that  of  solving  a  first-order  equation. 


Application  to  the  case  of  equal  roots :  In  the  case  of  equal  roots,  we  found 
yft)  —  e~bt^2a  as  one  solution  of  the  equation 


a 


+  cy  =  0. 


We  can  find  a  second  solution  from  Equations  (5)  and  (6).  It  is  important 
to  realize  though,  that  Equations  (5)  and  (6)  were  derived  under  the 
assumption  that  our  differential  equation  was  written  in  the  form 


that  is,  the  coefficient  of  y"  was  one.  In  our  equation,  the  coefficient  of  y" 
is  a .  Hence,  we  must  divide  Equation  (7)  by  a  to  obtain  the  equivalent 
equation 

d2y  ,b4y  c 

— -  H - —  +  —  y  =0. 

dt 2  a  dt  a 
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Now,  we  can  insert  p(t)  =  b/ a  into  (5)  to  obtain  that 


Hence, 

yi{t)=yi{t)  f  dt=tyx(t) 

is  a  second  solution  of  (7).  The  functions  yx(t)  and  y2(t)  are  clearly  linearly 
independent  on  the  interval  —  oo  <t<oo.  Therefore,  the  general  solution 
of  (7)  in  the  case  of  equal  roots  is 

y(t)  =  c{e~bt/2a  +  c2te~bt/la  =  [c{  +  c2t]e~bt/2a 

Example  1.  Find  the  solution  y(t)  of  the  initial- value  problem 

^j+4^+4y  =  0;  .y(0)  =  l,  /(0)  =  3. 

Solution .  The  characteristic  equation  r2  +  4r  +  4  =  (r  +  2)2  =  0  has  two  equal 
roots  r{  =  r2  =  —  2.  Hence, 

y  (t)  =  cxe~2t  +  c2te~2t 

for  some  choice  of  constants  cv  c2.  The  constants  cx  and  c2  are  determined 
from  the  initial  conditions 

l=^(0)  =  c1 

and 

3=/(°)=  “2  C\  +  c2. 

This  implies  that  cY  =  1  and  c2  =  5,  so  that  y{t)  =  {\  +  5t)e~2t. 

Example  2.  Find  the  solution  y{t)  of  the  initial-value  problem 

{l~t2)^+lt^~2y=0;  y{0)=3,  y'W=~4 

on  the  interval  —  1  <  /  <  1. 

Solution .  Clearly,  yx(i)  =  t  is  one  solution  of  the  differential  equation 

We  will  use  the  method  of  reduction  of  order  to  find  a  second  solution 
y2(t)  of  (8).  To  this  end,  divide  both  sides  of  (8)  by  1  —  t2  to  obtain  the 
equivalent  equation 

d2y  .  2t  dy  2  _n 

dt 2  l-t2  dt  \-tiy 
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Then,  from  (5) 


and 

yi(t)  =  tf  dt=  -  =  ~(\  + 12) 

is  a  second  solution  of  (8).  Therefore, 

y(t)  =  clt-c2(l  +  t2) 

for  some  choice  of  constants  cv  c2.  (Notice  that  all  solutions  of  (9)  are  con¬ 
tinuous  at  t  =  ±  1  even  though  the  differential  equation  is  not  defined  at 
these  points.  Thus,  it  does  not  necessarily  follow  that  the  solutions  of  a  dif¬ 
ferential  equation  are  discontinuous  at  a  point  where  the  differential  equa¬ 
tion  is  not  defined — but  this  is  often  the  case.)  The  constants  cx  and  c2  are 
determined  from  the  initial  conditions 

3=^(0)=  -  c2  and  -4=y'(0)  =  cv 

Hence,  7(0=  -4f  +  3(l  +  /2). 


Exercises 


Find  the  general  solution  of  each  of  the  following  equations 

d2y 


^  _4+^.o 

dt2  dt  y 


2-4^-12-+9'“° 


dt 


Solve  each  of  the  following  initial-value  problems. 

2 

3.  9^+6j+>>  =  0;  >>(0)=l,/(0)=0 

4'  4$"4^+>,  =  0;  ^°)=0’  /(°)  =  3 

5.  Suppose  b2=4ac.  Show  that 

yl(t)  =  e~b(t~t°^2a  and  y2(t)  =  (t  —  t0)e~b^~to^2a 
are  solutions  of  (1)  for  every  choice  of  t0. 

Solve  the  following  initial-value  problems. 

6.  ?Z+2^+y  =  0-,  y(2)  —  1,  y'(2)=  —  1 

7.  9^-12  J+4^=0;  y(n)=0,  y'(n)  =  2 

8.  Let  a ,  b  and  c  be  positive  numbers.  Prove  that  every  solution  of  the  differential 
equation  ay"  +  by'  +  cy=0  approaches  zero  as  t  approaches  infinity. 
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9.  Here  is  an  alternate  and  very  elegant  way  of  finding  a  second  solution  y2(t)  of 

(1). 

(a)  Assume  that  b2  =  4ac.  Show  that 

L[ert]  =  a(ert)"  +  b(ert)'  +  cert  =  a(r  —  rx)2ert 

for  rx  =  —  b/2a. 

(b)  Show  that 

(d/ dr)L[ert]  =  L[(3/3r)er/]  =  L[tert]  =  2a(r  —  rx)ert  +  at(r—  rx)2ert. 

(c)  Conclude  from  (a)  and  (b)  that  L[teri']  =  0.  Hence,  y2(0  =  is  a  second 
solution  of  (1). 

Use  the  method  of  reduction  of  order  to  find  the  general  solution  of  the 
following  differential  equations. 


i2l_  2«  +  1)  » 


dV 


(t2+2t—  1)  dt  (t2+2t—l) 


y  = o  (>>i(0=/+i) 


11.  ^  -4t^  +(4t2-2)y  =  0  (yx(t)  =  e‘2) 


, 2 , d y  ~.dy 


\2.{\-t2)^-2t-^+2y  =  0  O'.(O-r) 


dt 2 


,*dy 


13.  (l  +  t*)^j-2t-^+2y=0  (y}(t)=t) 


dv 


14.  (1  - 12) 


2'd2y -2t^+6y  =  0  (.y,(0=3<2-l) 


dv 
d2y 


dy 


15.  (2/  +  l)-^-4(/+l)-^+4y  =  0  (^(/)-/+l) 
^  jd2y  dy 

16't^+td^+ 


('’4H  H-f) 


17.  Given  that  the  equation 

t^~(l+3t)^+3y=0 

has  a  solution  of  the  form  ect,  for  some  constant  c,  find  the  general  solution. 

18.  (a)  Show  that  tr  is  a  solution  of  Euler’s  equation 

t2y"  +  aty' +  fiy  =0,  t>  0 
if  r2  4-  (a  —  l)r  +  ft  =  0. 

(b)  Suppose  that  (a  —  l)2  =  4/3.  Using  the  method  of  reduction  of  order,  show 
that  (In  *)/(1-a)/2  is  a  second  solution  of  Euler’s  equation. 

Find  the  general  solution  of  each  of  the  following  equations. 


19.  t 


2 d  y  . 


+3/-T7  +y =0 
dt2  dt  * 


20.  / 


0  d2y  dy 
2  -4-t-4;+y=0 
dt2  dt 
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2.3  The  nonhomogeneous  equation 

We  turn  our  attention  now  to  the  nonhomogeneous  equation 

2 

where  the  functions  /?(/),  #(/)  and  g(t)  are  continuous  on  an  open  interval 
a<t</i.  An  important  clue  as  to  the  nature  of  all  solutions  of  (1)  is  pro¬ 
vided  by  the  first-order  linear  equation 

~dt  —ity~  ~l-  (2) 

The  general  solution  of  this  equation  is 

y{t)  =  ce‘2  +  j. 

Now,  observe  that  this  solution  is  the  sum  of  two  terms:  the  first  term,  cet , 
is  the  general  solution  of  the  homogeneous  equation 

|-2<v-0  (3) 

while  the  second  term,  |,  is  a  solution  of  the  nonhomogeneous  equation 
(2).  In  other  words,  every  solution  y  (/)  of  (2)  is  the  sum  of  a  particular 
solution,  i/s(t)  =  b  a  solution  ce*  of  the  homogeneous  equation.  A  sim¬ 
ilar  situation  prevails  in  the  case  of  second-order  equations,  as  we  now 
show. 

Theorem  5.  Let  yft)  and  y2(t)  be  two  linearly  independent  solutions  of  the 
homogeneous  equation 

L[y]  =  ^Z+p(t)^+q(t)y=0  (4) 

and  let  \j/(t)  be  any  particular  solution  of  the  nonhomogeneous  equation  (1). 
Then ,  every  solution  y{t)  of  (1)  must  be  of  the  form 

y{t)  =  ciyx{t)  +  c2y2{t)  +  ^{t) 

for  some  choice  of  constants  cv  c2. 

The  proof  of  Theorem  5  relies  heavily  on  the  following  lemma. 

Lemma  1.  The  difference  of  any  two  solutions  of  the  nonhomogeneous  equa¬ 
tion  (1)  is  a  solution  of  the  homogeneous  equation  (4). 

Proof.  Let  ^ft)  and  \p2(t)  be  two  solutions  of  (1).  By  the  linearity  of  L, 
Hence,  *Pi(t)-ip2(t)  is  a  solution  of  the  homogeneous  equation  (4).  □ 
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We  can  now  give  a  very  simple  proof  of  Theorem  5. 

Proof  of  theorem  5.  Let  y{t)  be  any  solution  of  (1).  By  Lemma  1,  the 
function  <t>(t)=y(t)  —  \l>(t)  is  a  solution  of  the  homogeneous  equation  (4). 
But  every  solution  <f>(t)  of  the  homogeneous  equation  (4)  is  of  the  form 
<t>(t)  =  cxy  x(t)  +  c2y2(t),  for  some  choice  of  constants  cx,  c2.  Therefore, 

y(0=<K0 + HO =cij'i(0+ W2(0 + HO-  □ 


Remark.  Theorem  5  is  an  extremely  useful  theorem  since  it  reduces  the 
problem  of  finding  all  solutions  of  (1)  to  the  much  simpler  problem  of  find¬ 
ing  just  two  solutions  of  the  homogeneous  equation  (4),  and  one  solution 
of  the  nonhomogeneous  equation  (1). 


Example  1.  Find  the  general  solution  of  the  equation 


+y  =  t. 


Solution .  The  functions  yx{t)  =  cost  and  y2(t)  —  sint  are  two  linearly  inde¬ 
pendent  solutions  of  the  homogeneous  equation  y"  +y  =  0.  Moreover,  ^/(/) 
=  t  is  obviously  a  particular  solution  of  (5).  Therefore,  by  Theorem  5, 
every  solution  y{t)  of  (5)  must  be  of  the  form 

y(t)  —  cxcost  +  c2sin/  + 


Example  2.  Three  solutions  of  a  certain  second-order  nonhomogeneous 
linear  equation  are 

=  ^  =  t  + and  *p3(t)=  l  +  /  +  e'. 

Find  the  general  solution  of  this  equation. 

Solution,  By  Lemma  1,  the  functions 

HiO-t i(0  =  e'  and  f3(t)~ >M0  =  1 

are  solutions  of  the  corresponding  homogeneous  equation.  Moreover,  these 
functions  are  obviously  linearly  independent.  Therefore,  by  Theorem  5, 
every  solution  y(t)  of  this  equation  must  be  of  the  form 

y(t)  =  clet  +  c2+  t. 


Exercises 

1.  Three  solutions  of  a  certain  second-order  nonhomogeneous  linear  equation  arc 

\pi(t)  =  t2,  x//2(t)  =  t2  +  e2t 
and 

\p3(t)=  1  + 12  +  2e2t. 

Find  the  general  solution  of  this  equation. 
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2.  Three  solutions  of  a  certain  second-order  linear  nonhomogeneous  equation  are 

ip,(t)  =  1  +  e'\  \p2(t)  =\  +  te‘2 

and 

*3(0-(*+  iv2+i 

Find  the  general  solution  of  this  equation. 

3.  Three  solutions  of  a  second-order  linear  equation  L[y]  =  g(t)  are 

\pl(t)  =  3e*  +  e‘2,  +  e*2 

and 

i//3  ( /)  =  5  e '  +  e  ~  '3  -I-  e '2. 

Find  the  solution  of  the  initial- value  problem 

L[y]  =  g'>  y(0)=i,  /( 0)=2. 

4.  Let  a ,  Z?  and  c  be  positive  constants.  Show  that  the  difference  of  any  two  solu¬ 
tions  of  the  equation 

ay"  +  by'  +  cy  =  g(t) 

approaches  zero  as  t  approaches  infinity. 

5.  Let  be  a  solution  of  the  nonhomogeneous  equation  (1),  and  let  <j>(t)  be  a 
solution  of  the  homogeneous  equation  (4).  Show  that  *//(/)  +  </>(/)  is  again  a  solu¬ 
tion  of  (1). 

2.4  The  method  of  variation  of  parameters 

In  this  section  we  describe  a  very  general  method  for  finding  a  particular 
solution  \p(t)  of  the  nonhomogeneous  equation 

L[y]  =  -^+p(0-^+<i(0y=g((),  (i) 

once  the  solutions  of  the  homogeneous  equation 

2 

L[y]  =  -^+p(t)^-t+q(t)y=0  (2) 

are  known.  The  basic  principle  of  this  method  is  to  use  our  knowledge  of 
the  solutions  of  the  homogeneous  equation  to  help  us  find  a  solution  of  the 
nonhomogeneous  equation. 

Let  >>!(/)  and  y2(t)  be  two  linearly  independent  solutions  of  the  homoge¬ 
neous  equation  (2).  We  will  try  to  find  a  particular  solution  \p(t)  of  the 
nonhomogeneous  equation  (1)  of  the  form 

>K0  =  Ui(t)yM  +  U2(t)y2(t );  (3) 

that  is,  we  will  try  to  find  functions  ux(t)  and  u2(t)  so  that  the  linear  combi¬ 
nation  ux(t)yx(t)  +  u2(t)y2(t)  is  a  solution  of  (1).  At  first  glance,  this 
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would  appear  to  be  a  dumb  thing  to  do,  since  we  are  replacing  the  problem 
of  finding  one  unknown  function  xp(t)  by  the  seemingly  harder  problem  of 
finding  two  unknown  functions  ux(t)  and  u2(t).  However,  by  playing  our 
cards  right,  we  will  be  able  to  find  ux(t)  and  u2(t)  as  the  solutions  of  two 
very  simple  first-order  equations.  We  accomplish  this  in  the  following 
manner.  Observe  that  the  differential  equation  (1)  imposes  only  one  condi¬ 
tion  on  the  two  unknown  functions  ux(t )  and  u2(t).  Therefore,  we  have  a 
certain  “freedom”  in  choosing  ux(t)  and  u2(t).  Our  goal  is  to  impose  an 
additional  condition  on  ux(t)  and  u2(t)  which  will  make  the  expression 
L[uxyx  +  u2y2]  as  simple  as  possible.  Computing 

=  [uxy\  +  u2y'2]  +  [u\yx  +  u'2y2\ 

we  see  that  d 2\j// dt2,  and  consequently  L[i//],  will  contain  no  second-order 
derivatives  of  ux  and  u2  if 

yi(t)u'l{t)+y2{t)u'2(t)  =  0.  (4) 

This  suggests  that  we  impose  the  condition  (4)  on  the  functions  «,(/)  and 
u2(t).  In  this  case,  then, 

L[\p]  =  [u1y{  +  u2y'2]'+p(t)[u1y\  +  u2y'2]  +  q(t)[uiyl  +  u2y2] 

=  u\y\  +  u2y2+Ui[yi  +p(t)y\  +  q{t)yx  ]  +  u2[y2  +p{t)y2+ q{t)y2\ 
=  u\y\  +  u2y'2 

since  bothj^f)  andj>2(0  are  solutions  of  the  homogeneous  equation  L[y] 
=  0.  Consequently,  yp(t)-uxyx  4-  u2y2  is  a  solution  of  the  nonhomogeneous 
equation  (l)  if  ux(t)  and  u2(t)  satisfy  the  two  equations 

yi(t)u\(t)+y2(t)u'2(t)  =  0 
y'i(t)u'1(t)+y'2(t)u'2(t)  =  g(t). 


Multiplying  the  first  equation  by  y2(t),  the  second  equation  by  y2(t),  and 
subtracting  gives 

[y\  (OrtO)  ~y\  (0^2(0]«J  (0  =  -g(0*2(0> 

while  multiplying  the  first  equation  by /,(/),  the  second  equation  byy,(r), 
and  subtracting  gives 

bi(0>'2(0-/i(0>'2(0]«2(0=g(0M0- 

Hence, 


g(0*2(0  J  //N  g(0yi(0 

— : - ; —  and  uUt)= — - - — . 

w[y^2](0 


Finally,  we  obtain  ux{t)  and  u2(t)  by  integrating  the  right-hand  sides  of  (5). 
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Remark.  The  general  solution  of  the  homogeneous  equation  (2)  is 

By  letting  c,  and  c2  vary  with  time,  we  obtain  a  solution  of  the  nonhomo- 
geneous  equation.  Hence,  this  method  is  known  as  the  method  of  variation 
of  parameters. 


Example  1. 

(a)  Find  a  particular  solution  \p(t)  of  the  equation 


d2y 
dt 2 


+y  =  tan/ 


(6) 


on  the  interval  —  it / 2  <  /  <  */2. 

(b)  Find  the  solution  y  (/)  of  (6)  which  satisfies  the  initial  conditions  >>(0) 

1,/(0)=1. 


Solution . 

(a)  The  functions  >y1(/)  =  cos/  andy2(0  =  sin^  are  two  linearly  independent 
solutions  of  the  homogeneous  equation  y "  +y  =  0  with 


w  [  y  1^2  ]  ( 0  =y  i  y'i  -  y\  y2 = (cos  0cos  ‘  -  ( - sin  t  )sin  t  =  1 . 
Thus,  from  (5), 


u\  (t)=  -  tan t  sin t  and  u'2{t)  =  tan t  cos t.  (7) 

Integrating  the  first  equation  of  (7)  gives 


ux(t)=  —  J  ta,ntsintdt=  —  J 
cos2/  —  1 


sin2/ 


dt 


- 


cos  / 


cos  / 

dt  =  sin  /  —  ln|sec  /  +  tan  / 1. 


=  sin/  — ln(sec/  +  tan/), 

while  integrating  the  second  equation  of  (7)  gives 


U2(t)  =  j 


=  /  tan / cos t dt  =  J sintdt=  —  cost. 


Consequently, 

4'(t)  =  cos  /  [  sin  /  —  ln(sec  /  +  tan  /)  ]  +  sin  /  ( —  cos  /) 

=  —  cos  /  ln(sec  /  -I-  tan  /) 

is  a  particular  solution  of  (6)  on  the  interval  —m/2  <  /  <  */2. 

(b)  By  Theorem  5  of  Section  2.3, 

y{t)  =  CjCOS/4-  c2sin/  —  cos/ln(sec/  +  tan/) 

for  some  choice  of  constants  cl9  c2.  The  constants  cx  and  c2  are  determined 
from  the  initial  conditions 


1  =y  (0)  =  cx  and  \=y'(0)  =  c2— \ . 
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Hence,  c{  =  1,  c2  =  2  and 

y  ( r )  =  cos  t  +  2  sin  t  —  cos  t  ln(sec  1 4*  tan  t ) . 

Remark.  Equation  (5)  determines  ux(t)  and  u2(t)  up  to  two  constants  of  in¬ 
tegration.  We  usually  take  these  constants  to  be  zero,  since  the  effect  of 
choosing  nonzero  constants  is  to  add  a  solution  of  the  homogeneous  equa¬ 
tion  to  l//(t). 


Exercises 

Find  the  general  solution  of  each  of  the  following  equations. 

1  d2?  .  ,  7T  .  .  7T 

1.  — T+y  =  secr, 

dt2  1  1 


2. 


d2y  dy  „ 

— -  —  4  -7-  +  4 y  =  te2 
dt 2  *  7 


4.  ^-3^+2,-, e»+l 

dt 2  A  ^ 

Solve  each  of  the  following  initial-value  problems. 


5.  3/'+4/+y  =  (sin0e"';  y(0)  =  1,  y'(0)  =  0 

6.  /'  +  4/  +  4y  =  /5/V2';  y(0)=y'(0)  =  0 

7.  y"-3/  +  2y  =  V7+T  ;  y(0)=/(0)  =  0 


8.y"-y=/(0;  y(0)=y'(0)  =  0 


Warning.  It  must  be  remembered,  while  doing  Problems  3  and  5,  that 
Equation  (5)  was  derived  under  the  assumption  that  the  coefficient  of  y" 
was  one. 


9.  Find  two  linearly  independent  solutions  of  t2y"  —  2y  =  0  of  the  form  y(t)=  tr. 
Using  these  solutions,  find  the  general  solution  of  t2y"  —  2y  =  t2. 

10.  One  solution  of  the  equation 

y”+P(t)y'  +  q{t)y  =  Q  (*) 

is  (1  + 1)2,  and  the  Wronskian  of  any  two  solutions  of  (*)  is  constant.  Find  the 
general  solution  of 

y"  +p{t)y'  +  q(t)y  =  \  +  t. 


11.  Find  the  general  solution  of  y"  +  (l/4f2)y  =/cosf,  />0,  given  thaty1(/)=  V/ 
is  a  solution  of  the  homogeneous  equation. 
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12.  Find  the  general  solution  of  the  equation 

d2y  2t<ty  2  ,  ,  ,2 

13.  Show  that  sec  t  +  tan  t  is  positive  for  —  tt/2  <  t  <  it  jl. 


2.5  The  method  of  judicious  guessing 


A  serious  disadvantage  of  the  method  of  variation  of  parameters  is  that  the 
integrations  required  are  often  quite  difficult.  In  certain  cases,  it  is  usually 
much  simpler  to  guess  a  particular  solution.  In  this  section  we  will  establish 
a  systematic  method  for  guessing  solutions  of  the  equation 


a 


+  cy  =  g(t) 


where  a ,  b  and  c  are  constants,  and  g(t)  has  one  of  several  special  forms. 
Consider  first  the  differential  equation 


+  cy  =  a0+a1t+ ...  +antn* 


We  seek  a  function  ;//(/)  such  that  the  three  functions  a\p",  b$'  and  c\f/  add 
up  to  a  given  polynomial  of  degree  n.  The  obvious  choice  for  y^{t)  is  a 
polynomial  of  degree  n.  Thus,  we  set 


t//(/‘)  =  v4Q  +  y4|/  +  ...  +  Antn 


and  compute 


=  a[ 2A2  +...+«(«—  \)Antn  2^  +  b[Ax+  ...+nAntn 
+  c\_A0  +  Axt+  ...  A ntn~\^ 

=  cAntn  +  (cAn_x  +  nbAn)tn~l  +  ...  +(cA0+  bAl  +  2aA1) . 


Equating  coefficients  of  like  powers  of  t  in  the  equation 


gives 


L\\p  (/)  —  cIq  +  a^t  +  . . .  +  antn 


cAn  =  an,  cAn_x  +  nbAn  =  an_v...,cA0+  bAx  +  2aA2  =  a0. 


The  first  equation  determines  An  =  an/  c ,  for  0,  and  the  remaining  equa¬ 
tions  then  determine  An_v...,A0  successively.  Thus,  Equation  (1)  has  a 
particular  solution  \f/(t)  of  the  form  (3),  for  c^O. 

We  run  into  trouble  when  c  =  0,  since  then  the  first  equation  of  (4)  has 
no  solution  An.  This  difficulty  is  to  be  expected  though,  for  if  c  =  0,  then 
L[ty]  =  aty"  +  bxp'  is  a  polynomial  of  degree  n  —  1,  while  the  right  hand  side 
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of  (2)  is  a  polynomial  of  degree  n.  To  guarantee  that  a\p”  +  b\p'  is  a  poly¬ 
nomial  of  degree  n ,  we  must  take  ^  as  a  polynomial  of  degree  «  + 1.  Thus, 
we  set 

ip(t) =  /[^4q  +  A  xt  +  . . .  +  Antn^.  (5) 

We  have  omitted  the  constant  term  in  (5)  since  y  =  constant  is  a  solution  of 
the  homogeneous  equation  ay”  +  by' =  0,  and  thus  can  be  subtracted  from 
\[/(t).  The  coefficients  A0 ,  Ax,...,An  are  determined  uniquely  (see  Exercise 
19)  from  the  equation 

axp  ”  +  b\ y  =  a0  +  axt  +  ...  +  antn 


if  b^ 0. 

Finally,  the  case  b  —  c  =  0  is  trivial  to  handle  since  the  differential  equa¬ 
tion  (2)  can  then  be  integrated  immediately  to  yield  a  particular  solution 
of  the  form 


a0t2  a,f3  a„tn+2 

1’2  2-3  (n  +  l)(n  +  2) 


Summary.  The  differential  equation  (2)  has  a  solution  \p(t)  of  the  form 


Ag-h  A  jt  ~h  ...  4-  A  ntn , 
t  (y4o  +  Axt  +  ...  +  Antn), 
t2(A0  +  Axt  +  ...  +  A  ntn\ 


0 

c  =  0,  b^O . 
c  =  b  =  0 


Example  1.  Find  a  particular  solution  i(/(t)  of  the  equation 


L[y]  =  ^  +  ^7+y  =  '2- 


dt2  dt 
,2 


(6) 


Solution.  We  set  =  A0  +  Axt  +  A2t2  and  compute 

£[^](0  =  ^'X0  +  *//'(0  +  '//(0 

=  2A2  +  (A  x  +  2A2t)  +  Aq  +  A  xt  +  A2t 2 
=  (Tq  +  A  j  4-  2  A 2)  4-  (A  x  4-  2A2)t  4*  A  2t2 . 

Equating  coefficients  of  like  powers  of  t  in  the  equation  L[i//](/)  =  r  gives 

A2  =  Ax-\-2A2  —  0 

and 


-  A 


Aq  4-  A  |  4-  2  A  2 =  0. 

The  first  equation  tells  us  that  A2  =  1,  the  second  equation  then  tells  us  that 
Ax  =  —2,  and  the  third  equation  then  tells  us  that  A0  =  0.  Hence, 

( /)  =  —  2/+  t2 

is  a  particular  solution  of  (6). 
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Let  us  now  re-do  this  problem  using  the  method  of  variation  of  parame¬ 
ters.  It  is  easily  verified  that 

7,(0=  e  */2cosV3//2  and  T2(0==e  ^2sinVy//2 
are  two  solutions  of  the  homogeneous  equation  L[y]  —  0.  Hence, 

\l/(t)  =  ul(t)e~t/2c°sVJ  t/2+  u2(t)e~^2sin  V3  t/ 2 
is  a  particular  solution  of  (6),  where 


and 


-t2e  '/2sinV3//2 
W[yx,y2\{t)  d‘ 


J't2et/2sinVJ  t/2dt 


u2(t)  =  J 


t2e  ///2cosVy//2 

wW2](0 


dt  = 


-pr  t2et^2cos  V3  t/2dt 


These  integrations  are  extremely  difficult  to  perform.  Thus,  the  method  of 
guessing  is  certainly  preferrable,  in  this  problem  at  least,  to  the  method  of 
variation  of  parameters. 

Consider  now  the  differential  equation 

2 

L[y]  =  a^+b^  +  (y  =  (ao+a\t+ ■■■+a„tn)e°“.  (7) 

We  would  like  to  remove  the  factor  eat  from  the  right-hand  side  of  (7),  so 
as  to  reduce  this  equation  to  Equation  (2).  This  is  accomplished  by  setting 
y(t)  =  eatv(t).  Then, 


y'  =  eat  (i/  +  ao)  and  y"  =  eat(v"+  2av'  +  a2v) 


so  that 


L[  y~\  =  eat[av"  +  (2aa  +  b)v'  +  (aa2  +  ba  +  c)uj. 
Consequently,  y(t)  =  eatv(t)  is  a  solution  of  (7)  if,  and  only  if, 

a~7J  +(2aa  +  6)^—  +  (aa2  +  ba  +  c)v  =  a0  +  a^ + ...  (8) 

In  finding  a  particular  solution  v(t)  of  (8),  we  must  distinguish  as  to 
whether  (i)  aa2+ba  +  c¥z0;  (ii)  aa2  +  ba  +  c  =  0,  but  2aa  +  b=£0;  and  (iii) 
both  aa2+ba  +  c  and  2 aa  +  b  =  0.  The  first  case  means  that  a  is  not  a  root 
of  the  characteristic  equation 

ar 2  +  br  +  c  =  0.  (9) 

In  other  words,  eat  is  not  a  solution  of  the  homogeneous  equation  L[y]  =  0. 
The  second  condition  means  that  a  is  a  single  root  of  the  characteristic 
equation  (9).  This  implies  that  eat  is  a  solution  of  the  homogeneous  equa¬ 
tion,  but  teat  is  not.  Finally,  the  third  condition  means  that  a  is  a  double 
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root  of  the  characteristic  equation  (9),  so  that  both  eat  and  teat  are  solu¬ 
tions  of  the  homogeneous  equation.  Hence,  Equation  (7)  has  a  particular 
solution  xf/(t)  of  the  form  (i)  =  •••  +  Antn)eat,  if  eat  is  not  a  solu¬ 

tion  of  the  homogeneous  equation;  (ii)  ^(f)  —  t(AQ+  ...  +  Antn)ea\  if  eat  is 
a  solution  of  the  homogeneous  equation  but  teat  is  not;  and  (iii)  *K0  = 
t2(AQ  +  ...  +Antn)eat  if  both  eat  and  teat  are  solutions  of  the  homogeneous 
equation. 


Remark.  There  are  two  ways  of  computing  a  particular  solution  of  (7). 
Either  we  make  the  substitution y  =  eatv  and  find  v(t)  from  (8),  or  we  guess 
a  solution  \//(t)  of  the  form  eat  times  a  suitable  polynomial  in  t.  If  a  is  a 
double  root  of  the  characteristic  equation  (9),  or  if  n  >  2,  then  it  is  advis¬ 
able  to  set  y  =  eatv  and  then  find  v(t)  from  (8).  Otherwise,  we  guess  \p(t) 
directly. 


Example  2.  Find  the  general  solution  of  the  equation 

rt-44y+4y  =  (l  +  t+...+t27)e2'.  (10) 

dt2  dt 

Solution.  The  characteristic  equation  r2-4r  +  4  =  0  has  equal  roots  rx  =  r2 
=  2.  Hence,  yx(t)  =  e2t  and  y2(t)=te2t  are  solutions  of  the  homogeneous 
equation  >>"- 4/ +  4y  =  0.  To  find  a  particular  solution  i p{t)  of  (10),  we  set 
y  =  e2tv.  Then,  of  necessity, 

^=i +t+t2+...+t27. 

dt 2 


Integrating  this  equation  twice,  and  setting 
equal  to  zero  gives 


the  constants  of  integration 


28-29* 


Hence,  the  general  solution  of  (10)  is 


y(t)  =  cxe2t  +  c2ten  -I-  e 


It  i  „2t 


1-2 


+  . . .  4* 


t 


29 


28-29 


=  <?2, 


1 2  t29 
Ci  +  Cit  +  - — —  +  . . .  + 


1^1'2  1-2 . 28-29 

It  would  be  sheer  madness  (and  a  terrible  waste  of  paper)  to  plug  the  ex- 


pression 


v//(/)  =  f2(;40  +  Axt+  ...  +A21t21)e 
into  (10)  and  then  solve  for  the  coefficients  A  q,  A  j, . . .  ,A2j. 


Example  3.  Find  a  particular  solution  \p( t )  of  the  equation 


d2y  .dy 

dt 2  dt 


+  2y  =  {\  +  t)ev. 
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Solution .  In  this  case,  e3t  is  not  a  solution  of  the  homogeneous  equation 
y"  —  3y'4-2y  =  0.  Thus,  we  set  ip(t)  =  (A0+  Axt)e3t.  Computing 


L[\p](t)  =  \p"  —  3\l/'  +  2\p 

=  e3t[  (9Aq  4-  6  A  j  4-  9  Axt)  —  3(3  A  q  4-  A  x  4-  3  A  j/)  4-  2(A  0  4*  A  xt)  J 
=  e3t^  (2Aq  4-  3  A\)  4-  2  A  xt  J 

and  cancelling  off  the  factor  e3t  from  both  sides  of  the  equation 

L[^](t)  =  (\  +  t)e3\ 


gives 


2 Axt  4-  (2 Aq  4-  3 A  j)  =  1  4-  /. 

This  implies  that  2At  =  l  and  2A0  +  3Al  =  1.  Hence,  Ax  =  \,  A0=  —  \  and 


Finally,  we  consider  the  differential  equation 


d2y  dy 

a — r  +  6  —  +  cy  =  (a0+  axt  +  ...  +antn)X 


dv 


cosc ot 
.  sinco/ 


We  can  reduce  the  problem  of  finding  a  particular  solution  \f/(t)  of  (11)  to 
the  simpler  problem  of  finding  a  particular  solution  of  (7)  with  the  aid  of 
the  following  simple  but  extremely  useful  lemma. 


Lemma  1.  Let  y(t)=  u(t)  +  iv(t)  be  a  complex-valued  solution  of  the  equa¬ 
tion 


L  [  y  ] = a -77 + b % + cy = s  ( 0 = s  i  ( 0 + 2  ( 0 


(12) 


dt2  "  dt 

where  a ,  b  and  c  are  real.  This  means ,  of  course ,  that 

a[  u"(t)  4-  iv”{f)  ]  +  b[  u'(t)  4-  iv'(t)  ]  +  c[  u(t)  4-  iv(t)  ]  =  gx  (/)  4-  zg2(/). 

(13) 

Then ,  L[u](t)  =  gx(t)  and  L[v](t)  =  g2(t). 

Proof.  Equating  real  and  imaginary  parts  in  (13)  gives 

au"(t)  4-  bu'(t)  4-  cu(t)  =  gx  (t) 

and 

av"(t )  4-  bv'(t)  4-  cv(t)  =  g2(t).  □ 


Now,  let  <j>(t)  =  u(t)  +  iv(t)  be  a  particular  solution  of  the  equation 

d2y  dy 

a — —  4- +  cy  =  (aQ  +  ...  4 - antn>)el03t . 


dt 


(14) 


The  real  part  of  the  right-hand  side  of  (14)  is  (a0+  ...  +  antn) cos ut,  while 
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the  imaginary  part  is  (tf0+  ...  +  antn)sina)t.  Hence,  by  Lemma  1 

M(/)  =  Re{<(>(/)} 

is  a  solution  of 


ay"  +  by'  +  cy  =  (a0  +  ...  +  antn)coso>t 

while 

o(i)  =  Im{<|>(r)} 

is  a  solution  of 

ay"  +  by'  +  cy  =  (a0  +  ...  +  antn)sincot. 


Example  4.  Find  a  particular  solution  \f/(t)  of  the  equation 

L[y]  =  ^+4y  =  sin2t.  (15) 

Solution .  We  will  find  i p(t)  as  the  imaginary  part  of  a  complex-valued  solu¬ 
tion  <#>(/)  of  the  equation 

I'[y]  =  ^+4y  =  e2“-  (16) 

To  this  end,  observe  that  the  characteristic  equation  r2  +  4  =  0  has  complex 
roots  r—  ±2 i.  Therefore,  Equation  (16)  has  a  particular  solution  </>(/)  of  the 
form  </>(/)  =*A0te2it.  Computing 

<f>'(t)  =  A0{\  +  2  it)e2lt  and  </>"(/)  =  A0(4i  ~  )e2lt 

we  see  that 

L[<p](t)  =  <p”{t)  +  4<p{t)  =  4iAtf2it . 

Hence,  A0=  1/4/=  -  i/4  and 

<t>(t)=  -  ^e2lt=  ~  ^(cos2/  +  /sin2/)=  ^  sin2/  —  /^-  cos2/. 

Therefore,  ^(/)  =  Im(<#>(/))  =  — (//4)cos2/  is  a  particular  solution  of  (15). 

Example  5.  Find  a  particular  solution  \p(t)  of  the  equation 

d2v 

— i-  +4y  =  cos2/.  (17) 

dt2 

Solution .  From  Example  4,  </>(/)  =  (//4)sin2/  — /(//4)cos2/  is  a  complex¬ 
valued  solution  of  (16).  Therefore, 

i|/(?)  =  Re(<(>(0}  =  ^  sin2f 

is  a  particular  solution  of  (17). 
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Example  6.  Find  a  particular  solution  \(/(t)  of  the  equation 

L[y]  =  'dti+2<ji+y  =  te‘COSt'  (18) 

Solution .  Observe  that  te* cost  is  the  real  part  of  Therefore,  we  can 

find  \f/(t)  as  the  real  part  of  a  complex-valued  solution  <j>(t)  of  the  equation 

Liy]=^+2^+y=te(l+i)i-  (i9> 

To  this  end,  observe  that  1  +  /  is  not  a  root  of  the  characteristic  equation 
r2  +  2r+  1  =0.  Therefore,  Equation  (19)  has  a  particular  solution  <#>(/)  of  the 
form  <j>(t)  =  (A0-\-Alt)e^  +  l)t.  Computing  L[</>]  =  <#>"  +  2</>'  +  </>,  and  using  the 
identity 

( i  +  02 + 2(  i  +  o + 1  =  [  ( i  +  0 + 1  ]2 = (2 + o2 

we  see  that 

[  (2  +  /)^ A  ^t  +  (2  +  Aq  +  2(2  +  i)A  j  j  =  /. 

Equating  coefficients  of  like  powers  of  t  in  this  equation  gives 

(2  +  if  A ,  =  1 

and 

(2  4-  i)Aq  +  2 A  j  =  0. 

This  implies  that  Al  =  1  /(2  +  i)2  and  A0=  -2/(2  + 1)3,  so  that 


<ho= 

L(2  +  0‘ 

After  a  little  algebra,  we  find  that 


-2 


+ 


t 


(2  +  if 


,0  +  i)t 


</>(/) =  { [(15/  — 4)cos/  +  (20/-22)sin/] 


+  /  (22  —  20/)cos/  +  (15/  —  4)sin/] }. 


Hence, 

^(/)  =  Re{</>(/)}  =  [(15/-4)cos/  +  (20/-22)sin/]. 

Remark.  The  method  of  judicious  guessing  also  applies  to  the  equation 

L[y]=a^ +b<ii+cy=  2  Pjif)^  (20) 

where  the  pj(t)J=\9...,n  are  polynomials  in  /.  Let  \[/.(t)  be  a  particular 
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solution  of  the  equation 

L[y]=Pj(t)eC‘J^  j=l,...,n. 

Then,  v|/(0  =  2”=1^(0  is  a  solution  of  (20)  since 

L[t]-L  ttj  =  2  4^]  =  i 

j=\  J  7=1  7=1 

Thus,  to  find  a  particular  solution  of  the  equation 

y"+y'+y  =  e'  +  /sin/ 

we  find  particular  solutions  t/q(/)  and  \p2(0  of  the  equations 

y"+y'+y  =  el  and  y"+y'+y  =  /sin/ 
respectively,  and  then  add  these  two  solutions  together. 

Exercises 

Find  a  particular  solution  of  each  of  the  following  equations. 

1.  y"  +  3y  =  /3  —  1  2.  y"  +  4y'  +  4y  =  teat 

3.  y"  ~y  —  t2e*  4.  y"  +y'+y  =  1  + /  + 12 

5.  y"  +  2y'+y  =  e-'  6.  y"  +  5y'  +  4y  =  /V' 

7.  y"  +  4y  =  /sin2/  8.  y" -6y'  +  9y  =  (3/7-5/4)e3' 

9.  y"-2y'  +  5y  =  2 cos2/  10.  y"  —  2y'  +  5y  =  2(cos2t)et 

11.  y"  +y'  —  6y  =  sin/  +  te2t  12.  y"  +y'  +  4y  =  /2  +  (2/  +  3)(1  4- cos/) 

13.  y"-3y'  +  2y  =  e'  +  e2'  14.  y"  +  2y'  =  1  + /2  +  e"2' 

15.  y" +y  =  cos/cos2/  16.  y"  +y  =  cos /cos 2/ cos 3/. 

17.  (a)  Show  that  cos3<o/  =  ^Re{e3“°'  +  3e/<‘5'}. 

Hint :  cos<d/  =  (e,i0'  +  e~i0)t)/2. 

(b)  Find  a  particular  solution  of  the  equation 

lOy  "  +  0.2y'  +  lOOOy  =  5  +  20cos3 10/ 

18.  (a)  Let  L[y]=y"  —  2rxyf  +  r2y.  Show  that 

L[er'tv(t)]  =  er'tv"(t). 

(b)  Find  the  general  solution  of  the  equation 

y"-6y'  +  9y  =  /3/2e3'. 

19.  Let  \p(t)  =  t(A0  +  ...  +Antn),  and  assume  that  b^=0.  Show  that  the  equation 
a\f/"  +  b\p'  =  a0+  ...  +  antn  determines  A0,...,An  uniquely. 
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2.6  Mechanical  vibrations 

Consider  the  case  where  a  small  object  of  mass  m  is  attached  to  an  elastic 
spring  of  length  /,  which  is  suspended  from  a  rigid  horizontal  support  (see 
Figure  1).  (An  elastic  spring  has  the  property  that  if  it  is  stretched  or  com¬ 
pressed  a  distance  A /  which  is  small  compared  to  its  natural  length  /,  then 
it  will  exert  a  restoring  force  of  magnitude  &A/.  The  constant  k  is  called 
the  spring-constant,  and  is  a  measure  of  the  stiffness  of  the  spring.)  In 
addition,  the  mass  and  spring  may  be  immersed  in  a  medium,  such  as  oil, 
which  impedes  the  motion  of  an  object  through  it.  Engineers  usually  refer 
to  such  systems  as  spring-mass-dashpot  systems,  or  as  seismic  instruments, 
since  they  are  similar,  in  principle,  to  a  seismograph  which  is  used  to  detect 
motions  of  the  earth’s  surface. 


Figure  1 


Spring-mass-dashpot  systems  have  many  diverse  applications.  For  ex¬ 
ample,  the  shock  absorbers  in  our  automobiles  are  simple  spring-mass- 
dashpot  systems.  Also,  most  heavy  gun  emplacements  are  attached  to  such 
systems  so  as  to  minimize  the  “recoil”  effect  of  the  gun.  The  usefulness  of 
these  devices  will  become  apparent  after  we  set  up  and  solve  the  differen¬ 
tial  equation  of  motion  of  the  mass  m. 

In  calculating  the  motion  of  the  mass  m,  it  will  be  convenient  for  us  to 
measure  distances  from  the  equilibrium  position  of  the  mass,  rather  than 
the  horizontal  support.  The  equilibrium  position  of  the  mass  is  that  point 
where  the  mass  will  hang  at  rest  if  no  external  forces  act  upon  it.  In 
equilibrium,  the  weight  mg  of  the  mass  is  exactly  balanced  by  the  restoring 
force  of  the  spring.  Thus,  in  its  equilibrium  position,  the  spring  has  been 
stretched  a  distance  A/,  where  kM=mg.  We  lety  =  0  denote  this  equi¬ 
librium  position,  and  we  take  the  downward  direction  as  positive.  Lety(/) 
denote  the  position  of  the  mass  at  time  t.  To  findy(/),  we  must  compute 
the  total  force  acting  on  the  mass  m.  This  force  is  the  sum  of  four  separate 
forces  W,  R ,  D  and  F. 
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(i)  The  force  W  =  mg  is  the  weight  of  the  mass  pulling  it  downward. 
This  force  is  positive,  since  the  downward  direction  is  the  positive  y  direc¬ 
tion. 

(ii)  The  force  R  is  the  restoring  force  of  the  spring,  and  it  is  proportional 
to  the  elongation,  or  compression,  A l+y  of  the  spring.  It  always  acts  to  re¬ 
store  the  spring  to  its  natural  length.  If  A/+y>0,  then  R  is  negative,  so 
that  R=  —  &(A/  +  y),  and  if  A/  +  y  <0,  then  R  is  positive,  so  that  R  = 
—  &(A/+y).  In  either  case, 

R  =  —  k(Al +y). 

(iii)  The  force  D  is  the  damping,  or  drag  force,  which  the  medium  exerts 
on  the  mass  m.  (Most  media,  such  as  oil  and  air,  tend  to  resist  the  motion 
of  an  object  through  it.)  This  force  always  acts  in  the  direction  opposite  the 
direction  of  motion,  and  is  usually  directly  proportional  to  the  magnitude 
of  the  velocity  dy / dt.  If  the  velocity  is  positive;  that  is,  the  mass  is  moving 
in  the  downward  direction,  then  D—  —  cdy  /  dt,  and  if  the  velocity  is  nega¬ 
tive,  then  D=  —  cdy/dt.  In  either  case, 

D  —  —cdy/dt. 

(iv)  The  force  F  is  the  external  force  applied  to  the  mass.  This  force  is 
directed  upward  or  downward,  depending  as  to  whether  F  is  positive  or 
negative.  In  general,  this  external  force  will  depend  explicitly  on  time. 

From  Newton’s  second  law  of  motion  (see  Section  1.7) 

d2y 

m — 7  =  W+  R  +  D  +  F 

dt2 

=  mg-k(  A/  +y)~c^  +  F(t) 

=  -ky-C^+m, 

since  mg  =  kkl.  Hence,  the  position  y(t)  of  the  mass  satisfies  the  second- 
order  linear  differential  equation 

m^+c^+ky=F{t)  (1) 
where  m,  c  and  k  are  nonnegative  constants.  We  adopt  here  the  mks  system 
of  units  so  that  F  is  measured  in  newtons,  y  is  measured  in  meters,  and  t  is 
measured  in  seconds.  In  this  case,  the  units  of  k  are  N /m,  the  units  of  c  are 
N  •  s/m,  and  the  units  of  m  are  kilograms  (N  •  s2/m) 


(a)  Free  vibrations: 

We  consider  first  the  simplest  case  of  free  undamped  motion.  In  this  case, 
Equation  (1)  reduces  to 


d2y  ,  n  d2y 

m — —  +  ky  =  0  or  — - 

dt 2  dt 2 


+  <OoJ>  =  0 


166 


2.6  Mechanical  vibrations 


where  c o\  —  k/m.  The  general  solution  of  (2)  is 

y  ( /)  =  a  cos  co0t  +  b  sin  co0/.  (3) 

In  order  to  analyze  the  solution  (3),  it  is  convenient  to  rewrite  it  as  a  single 
cosine  function.  This  is  accomplished  by  means  of  the  following  lemma. 

Lemma  1.  Any  function  y{t)  of  the  form  (3)  can  be  written  in  the  simpler 
form 

y{t)  =  Rcos(co0t  —  S)  (4) 

where  /?  =  V a2 -f  b2  and  S  =  tan-1fc/a. 

Proof.  We  will  verify  that  the  two  expressions  (3)  and  (4)  are  equal.  To 
this  end,  compute 

R  cos  (u0t  —  8)  =  R  cos  c o0t  cos  8  +  R  sin  c o0t  sin  8 
and  observe  from  Figure  2  that  Rcos8  =  a  and  Rsin8  =  b.  Hence, 

R  cos(c o0t  —  8)  =  a  cos  o)0t  +  b  sin  u0t.  □ 


b 


a 

Figure  2 

In  Figure  3  we  have  graphed  the  function  y  =  R  cos (co0/  -  6).  Notice  that 
y  (t)  always  lies  between  —  R  and  +  R ,  and  that  the  motion  of  the  mass  is 
periodic — it  repeats  itself  over  every  time  interval  of  length  27r/co0.  This 
type  of  motion  is  called  simple  harmonic  motion;  R  is  called  the  amplitude 
of  the  motion,  8  the  phase  angle  of  the  motion,  T0  =  27r/o)0  the  natural 

period  of  the  motion,  and  <o0=  vk/m  the  natural  frequency  of  the  sys¬ 
tem. 


(b)  Damped  free  vibrations : 

If  we  now  include  the  effect  of  damping,  then  the  differential  equation 
governing  the  motion  of  the  mass  is 

d2y 
m — - 
dt2 

The  roots  of  the  characteristic  equation  mr2  +  cr  +  k  =  0  are 


ay 

4-  c  — — h  ky  =  0. 
dt 


(5) 


r\  = 


c  ~h  Vc2  —  4km 
2m 


and 


>2  = 


—  c  —  V c2  —  4km 
2m 
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Thus,  there  are  three  cases  to  consider,  depending  as  to  whether  c2  —  4km 
is  positive,  negative  or  zero. 

(i)  c 2  -  4 km  >  0.  In  this  case  both  rx  and  r2  are  negative,  and  every  solu¬ 
tion  y(t)  of  (5)  has  the  form 

y(t):s:aerit  +  ber2t. 

(ii)  c2  —  4km  =  0.  In  this  case,  every  solution  y(t)  of  (5)  is  of  the  form 

y(t)  =  (a  4-  bt)e~ct/2m. 

(iii)  c2-4km<0.  In  this  case,  every  solution y(t)  of  (5)  is  of  the  form 

>>(/)  =  e“c'/2m[0COSju/  +  Z>sin/ji/],  ji—  — ° — ’ 

The  first  two  cases  are  referred  to  as  overdamped  and  critically  damped, 
respectively.  They  represent  motions  in  which  the  originally  displaced  mass 
creeps  back  to  its  equilibrium  position.  Depending  on  the  initial  condi¬ 
tions,  it  may  be  possible  to  overshoot  the  equilibrium  position  once,  but  no 
more  than  once  (see  Exercises  2-3).  The  third  case,  which  is  referred  to  as 
an  under  damped  motion,  occurs  quite  often  in  mechanical  systems  and 
represents  a  damped  vibration.  To  see  this,  we  use  Lemma  1  to  rewrite  the 
function 

y(t)  =  e~ ctf2m [  a  cos +  b  sin  ju ,t ] 

in  the  form 

y(t)  =  Re~ ct/2m  cos(  [it  — S). 

The  displacement  y  oscillates  between  the  curves ±Re~ct^lm,  and  thus 
represents  a  cosine  curve  with  decreasing  amplitude,  as  shown  in  Figure  4. 

Now,  observe  that  the  motion  of  the  mass  always  dies  out  eventually  if 
there  is  damping  in  the  system.  In  other  words,  any  initial  disturbance  of 
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the  system  is  dissipated  by  the  damping  present  in  the  system.  This  is  one 
reason  why  spring-mass-dashpot  systems  are  so  useful  in  mechanical  sys¬ 
tems:  they  can  be  used  to  damp  out  any  undesirable  disturbances.  For  ex¬ 
ample,  the  shock  transmitted  to  an  automobile  by  a  bump  in  the  road  is 
dissipated  by  the  shock  absorbers  in  the  car,  and  the  momentum  from  the 
recoil  of  a  gun  barrel  is  dissipated  by  a  spring-mass-dashpot  system 
attached  to  the  gun. 

(c)  Damped  forced  vibrations : 

If  we  now  introduce  an  external  force  F(t)  =  F0cosoit ,  then  the  differential 
equation  governing  the  motion  of  the  mass  is 

d2y 
m — - 
dt2 

Using  the  method  of  judicious  guessing,  we  can  find  a  particular  solution 
ip(t)  of  (6)  of  the  form 


dy 

+  c~j‘  +  ky  =  F0coso)t. 


(6) 


m= 


o 


2, .2 


(k  —  mo)  )  4*  c  co 


[(/c—  mco2)coscot  +  coosinco/J 


o 


2,  J2  L 


>2\2  i  „2..2nl/2 


(k  —  mo)  )  +c o) 
F0cos(cot-8) 

— mco2)2  + c2co2  j 


( k  —  mcoz )  4“  c  iv  cos (<o/  — 5) 


(7) 


where  tan  5  =  c<o/( k  —  mco2).  Hence,  every  solution  y(t)  of  (6)  must  be  of 
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the  form 

Fncos(co/  —  S ) 

y  (t)  =  <j>(()  +  +(t)- =  4>(t)  + - ^ (8) 

(k  —  mco2)  +  c2co2J 

where  </>(/)  is  a  solution  of  the  homogeneous  equation 

m£i+c±+l9,,0.  (9) 

dr  dt  ' 

We  have  already  seen  though,  that  every  solution;'  =  <K0  of  (9)  approaches 
zero  as  t  approaches  infinity.  Thus,  for  large  /,  the  equation  y(t)  =  \p(t)  de¬ 
scribes  very  accurately  the  position  of  the  mass  m,  regardless  of  its  initial 
position  and  velocity.  For  this  reason,  is  called  the  steady  state  part  of 
the  solution  (8),  while  <f>(t)  is  called  the  transient  part  of  the  solution. 


(d)  Forced  free  vibrations : 

We  now  remove  the  damping  from  our  system  and  consider  the  case  of 
forced  free  vibrations  where  the  forcing  term  is  periodic  and  has  the  form 
F(t)  —  F0  cos  cot.  In  this  case,  the  differential  equation  governing  the  motion 
of  the  mass  m  is 


2  F0 

+  <Oq  y  =  —  coso)/, 
m 


The  case  cot^co0  is  uninteresting;  every  solution  y(t)  of  (10)  has  the  form 


y  (/)  =  cl  cos co0t  +  c2  sin oi0t  + 


o 


m(coo~to2) 


cos  co/, 


and  thus  is  the  sum  of  two  periodic  functions  of  different  periods.  The  in¬ 
teresting  case  is  when  co  =  co0;  that  is,  when  the  frequency  co  of  the  external 
force  equals  the  natural  frequency  of  the  system.  This  case  is  called  the  res¬ 
onance  case,  and  the  differential  equation  of  motion  for  the  mass  m  is 


^0 

- COSCOn/. 

m 


We  will  find  a  particular  solution  \p(t)  of  (11)  as  the  real  part  of  a 
complex-valued  solution  <£(/)  of  the  equation 


Since  elUot  is  a  solution  of  the  homogeneous  equation  y"  +  coq;>  =  0,  we 
know  that  (12)  has  a  particular  solution  §(t)  =  Atem°\  for  some  constant  A. 
Computing 

</>"  +  coq</>  =  2/c o0AelU3ot 
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we  see  that 


Hence, 


A  = 


1 


o 


2/co0  m 


zifi 

2  mo)0  ' 


<ko= 


-  iFo> 


2  mu 

F0t 


(cos  o)Qt  +  i  sin  w0/) 


0 


sin  u0t  - 1  - 

2wi(0q  2/77  co 


COS  0)0t 


0 


is  a  particular  solution  of  (12),  and 


F0t 


iH0  =  Re{<K0}  =  2^7  sinWot 


is  a  particular  solution  of  (11).  Consequently,  every  solution  y(t)  of  (11)  is 
of  the  form 


y(/)  =  c1cos<o0f  +  c2sinco0/  + 


F 0t 
2mo)0 


sin  co0t 


for  some  choice  of  constants  cvc2. 

Now,  the  sum  of  the  first  two  terms  in  (13)  is  a  periodic  function  of 
time.  The  third  term,  though,  represents  an  oscillation  with  increasing  am¬ 
plitude,  as  shown  in  Figure  5.  Thus,  the  forcing  term  F0cos(o/,  if  it  is  in  res¬ 
onance  with  the  natural  frequency  of  the  system,  will  always  cause  un¬ 
bounded  oscillations.  Such  a  phenomenon  was  responsible  for  the  collapse 
of  the  Tacoma  Bridge,  (see  Section  2.6.1)  and  many  other  mechanical 
catastrophes. 


y 
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Exercises 


1.  It  is  found  experimentally  that  a  1  kg  mass  stretches  a  spring  49/320  m.  If  the 
mass  is  pulled  down  an  additional  1  /4  m  and  released,  find  the  amplitude, 
period  and  frequency  of  the  resulting  motion,  neglecting  air  resistance  (use 
g  =  9.8  m/s2). 


2.  Lei  y(t)  =  Aerxt  +  Ber2\  with  \A  \  + 12?|^0. 

(a)  Show  that7(/)  is  zero  at  most  once. 

(b)  Show  that  y\t)  is  zero  at  most  once. 


3.  Let y(t)  =  {A  +  Bt)ert,  with  \A\  +  |2?| =^0. 

(a)  Show  that  >>(0  is  zero  at  most  once. 

(b)  Show  that  >>'(/)  is  zero  at  most  once. 


4.  A  small  object  of  mass  1  kg  is  attached  to  a  spring  with  spring  constant  2N/m. 
This  spring-mass  system  is  immersed  in  a  viscous  medium  with  damping 
constant  3  N-s/m.  At  time  t  —  0,  the  mass  is  lowered  1/2  m  below  its 
equilibrium  position,  and  released.  Show  that  the  mass  will  creep  back  to  its 
equilibrium  position  as  t  approaches  infinity. 


5.  A  small  object  of  mass  1  kg  is  attached  to  a  spring  with  spring-constant  1  N/m 
and  is  immersed  in  a  viscous  medium  with  damping  constant  2  N  •  s/m.  At  time 
t  —  0,  the  mass  is  lowered  1  /4  m  and  given  an  initial  velocity  of  1  m/s  in  the 
upward  direction.  Show  that  the  mass  will  overshoot  its  equilibrium  position 
once,  and  then  creep  back  to  equilibrium. 

6.  A  small  object  of  mass  4  kg  is  attached  to  an  elastic  spring  with  spring-constant 
64  N /m,  and  is  acted  upon  by  an  external  force  F(t)  =  A  cos3cot.  Find  all  values 
of  a?  at  which  resonance  occurs. 


7.  The  gun  of  a  U.S.  M60  tank  is  attached  to  a  spring- mass- dashpot  system  with 
spring-constant  100a2  and  damping  constant  200a,  in  their  appropriate  units. 
The  mass  of  the  gun  is  100  kg.  Assume  that  the  displacement  y(t)  of  the  gun 
from  its  rest  position  after  being  fired  at  time  t  =  0  satisfies  the  initial- value 
problem 

100y"  +  200a/  +  100a2y  =  0;  y(0)  =  0,  /(0)  =  100  m/s. 

It  is  desired  that  one  second  later,  the  quantity  y 2  +  (y')2  t>e  less  than  -01-  How 
large  must  a  be  to  guarantee  that  this  is  so?  (The  spring-mass-dashpot 
mechanism  in  the  M60  tanks  supplied  by  the  U.S.  to  Israel  are  critically 
damped,  for  this  situation  is  preferable  in  desert  warfare  where  one  has  to  fire 
again  as  quickly  as  possible). 

8.  A  spring-mass-dashpot  system  has  the  property  that  the  spring  constant  k  is  9 
times  its  mass  m ,  and  the  damping  constant  c  is  6  times  its  mass.  At  time  t  —  0, 
the  mass,  which  is  hanging  at  rest,  is  acted  upon  by  an  external  force  F(t)  = 
(3  sin3r)  N.  The  spring  will  break  if  it  is  stretched  an  additional  5  m  from  its 
equilibrium  position.  Show  that  the  spring  will  not  break  if  m  >  1  /5  kg. 

9.  A  spring-mass-dashpot  system  with  m  —  1,  k  —  2  and  c  —  2  (in  their  respective 
units)  hangs  in  equilibrium.  At  time  t  —  0,  an  external  force  F(t)  =  7r  —  f  N  acts 
for  a  time  interval  tt.  Find  the  position  of  the  mass  at  anytime  t  >  tt. 
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10.  A  1  kg  mass  is  attached  to  a  spring  with  spring  constant  k  —  64  N /m.  With  the 
mass  on  the  spring  at  rest  in  the  equilibrium  position  at  time  t  —  0,  an  external 
force  F(t)  =  (^t)  N  is  applied  until  time  tx  —  777/16  seconds,  at  which  time  it  is 
removed.  Assuming  no  damping,  find  the  frequency  and  amplitude  of  the 
resulting  oscillation. 

11.  A  1  kg  mass  is  attached  to  a  spring  with  spring  constant  k  =  4  N/m,  and  hangs 
in  equilibrium.  An  external  force  F(t)  =  (1  +  t  -\-sin2t)  N  is  applied  to  the  mass 
beginning  at  time  t  =  0.  If  the  spring  is  stretched  a  length  (1/2+7t/4)  m  or 
more  from  its  equilibrium  position,  then  it  will  break.  Assuming  no  damping 
present,  find  the  time  at  which  the  spring  breaks. 

12.  A  small  object  of  mass  l  kg  is  attached  to  a  spring  with  spring  constant  k  —  1 
N/m.  This  spring-mass  system  is  then  immersed  in  a  viscous  medium  with 
damping  constant  c.  An  external  force  F(t)  =  (3  —  cos  t)  N  is  applied  to  the 
system.  Determine  the  minimum  positive  value  of  c  so  that  the  magnitude  of  the 
steady  state  solution  does  not  exceed  5  m. 

13.  Determine  a  particular  solution  \p(t)  of  my"  +  cy'  +  ky  =  F0cosco/,  of  the  form 
\p(t)  =  A  cos(o)t  —  <f>).  Show  that  the  amplitude  A  is  a  maximum  when  oj2  =  (Oq 
—  \{c /  m)2.  This  value  of  <o  is  called  the  resonant  frequency  of  the  system.  What 
happens  when  coq<  \{c/ m)2ct 


2.6.1  The  Tacoma  Bridge  disaster 

On  July  1,  1940,  the  Tacoma  Narrows  Bridge  at  Puget  Sound  in  the  state 
of  Washington  was  completed  and  opened  to  traffic.  From  the  day  of  its 
opening  the  bridge  began  undergoing  vertical  oscillations,  and  it  soon  was 
nicknamed  “Galloping  Gertie.”  Strange  as  it  may  seem,  traffic  on  the 
bridge  increased  tremendously  as  a  result  of  its  novel  behavior.  People 
came  from  hundreds  of  miles  in  their  cars  to  enjoy  the  curious  thrill  of 
riding  over  a  galloping,  rolling  bridge.  For  four  months,  the  bridge  did  a 
thriving  business.  As  each  day  passed,  the  authorities  in  charge  became 
more  and  more  confident  of  the  safety  of  the  bridge — so  much  so,  in  fact, 
that  they  were  planning  to  cancel  the  insurance  policy  on  the  bridge. 

Starting  at  about  7:00  on  the  morning  of  November  7,  1940,  the  bridge 
began  undulating  persistently  for  three  hours.  Segments  of  the  span  were 
heaving  periodically  up  and  down  as  much  as  three  feet.  At  about  10:00 
a.m.,  something  seemed  to  snap  and  the  bridge  began  oscillating  wildly.  At 
one  moment,  one  edge  of  the  roadway  was  twenty-eight  feet  higher  than 
the  other;  the  next  moment  it  was  twenty-eight  feet  lower  than  the  other 
edge.  At  10:30  a.m.  the  bridge  began  cracking,  and  finally,  at  11:10  a.m. 
the  entire  bridge  came  crashing  down.  Fortunately,  only  one  car  was  on 
the  bridge  at  the  time  of  its  failure.  It  belonged  to  a  newspaper  reporter 
who  had  to  abandon  the  car  and  its  sole  remaining  occupant,  a  pet  dog, 
when  the  bridge  began  its  violent  twisting  motion.  The  reporter  reached 
safety,  torn  and  bleeding,  by  crawling  on  hands  and  knees,  desperately 
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clutching  the  curb  of  the  bridge.  His  dog  went  down  with  the  car  and  the 
span — the  only  life  lost  in  the  disaster. 

There  were  many  humorous  and  ironic  incidents  associated  with  the  col¬ 
lapse  of  the  Tacoma  Bridge.  When  the  bridge  began  heaving  violently,  the 
authorities  notified  Professor  F.  B.  Farquharson  of  the  University  of 
Washington.  Professor  Farquharson  had  conducted  numerous  tests  on  a 
simulated  model  of  the  bridge  and  had  assured  everyone  of  its  stability. 
The  professor  was  the  last  man  on  the  bridge.  Even  when  the  span  was  tilt¬ 
ing  more  than  twenty-eight  feet  up  and  down,  he  was  making  scientific  ob¬ 
servations  with  little  or  no  anticipation  of  the  imminent  collapse  of  the 
bridge.  When  the  motion  increased  in  violence,  he  made  his  way  to  safety 
by  scientifically  following  the  yellow  line  in  the  middle  of  the  roadway. 
The  professor  was  one  of  the  most  surprised  men  when  the  span  crashed 
into  the  water. 

One  of  the  insurance  policies  covering  the  bridge  had  been  written  by  a 
local  travel  agent  who  had  pocketed  the  premium  and  had  neglected  to  re¬ 
port  the  policy,  in  the  amount  of  $800,000,  to  his  company.  When  he  later 
received  his  prison  sentence,  he  ironically  pointed  out  that  his  embezzle¬ 
ment  would  never  have  been  discovered  if  the  bridge  had  only  remained 
up  for  another  week,  at  which  time  the  bridge  officials  had  planned  to 
cancel  all  of  the  policies. 

A  large  sign  near  the  bridge  approach  advertised  a  local  bank  with  the 
slogan  “as  safe  as  the  Tacoma  Bridge.”  Immediately  following  the  collapse 
of  the  bridge,  several  representatives  of  the  bank  rushed  out  to  remove  the 
billboard. 

After  the  collapse  of  the  Tacoma  Bridge,  the  governor  of  the  state  of 
Washington  made  an  emotional  speech,  in  which  he  declared  “We  are 
going  to  build  the  exact  same  bridge,  exactly  as  before.”  Upon  hearing 
this,  the  noted  engineer  Von  Karman  sent  a  telegram  to  the  governor  stat¬ 
ing  “If  you  build  the  exact  same  bridge  exactly  as  before,  it  will  fall  into 
the  exact  same  river  exactly  as  before.” 

The  collapse  of  the  Tacoma  Bridge  was  due  to  an  aerodynamical  phe¬ 
nomenon  known  as  stall  flutter.  This  can  be  explained  very  briefly  in  the 
following  manner.  If  there  is  an  obstacle  in  a  stream  of  air,  or  liquid,  then 
a  “vortex  street”  is  formed  behind  the  obstacle,  with  the  vortices  flowing 
off  at  a  definite  periodicity,  which  depends  on  the  shape  and  dimension  of 
the  structure  as  well  as  on  the  velocity  of  the  stream  (see  Figure  1).  As  a 
result  of  the  vortices  separating  alternately  from  either  side  of  the  obstacle, 
it  is  acted  upon  by  a  periodic  force  perpendicular  to  the  direction  of  the 
stream,  and  of  magnitude  F0cosatf.  The  coefficient  F0  depends  on  the 
shape  of  the  structure.  The  poorer  the  streamlining  of  the  structure;  the 
larger  the  coefficient  and  hence  the  amplitude  of  the  force.  For  exam¬ 
ple,  flow  around  an  airplane  wing  at  small  angles  of  attack  is  very  smooth, 
so  that  the  vortex  street  is  not  well  defined  and  the  coefficient  F0  is  very 
small.  The  poorly  streamlined  structure  of  a  suspension  bridge  is  another 
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matter,  and  it  is  natural  to  expect  that  a  force  of  large  amplitude  will  be  set 
up.  Thus,  a  structure  suspended  in  an  air  stream  experiences  the  effect  of 
this  force  and  hence  goes  into  a  state  of  forced  vibrations.  The  amount  of 
danger  from  this  type  of  motion  depends  on  how  close  the  natural 
frequency  of  the  structure  (remember  that  bridges  are  made  of  steel,  a 
highly  elastic  material)  is  to  the  frequency  of  the  driving  force.  If  the  two 
frequencies  are  the  same,  resonance  occurs,  and  the  oscillations  will  be  de¬ 
structive  if  the  system  does  not  have  a  sufficient  amount  of  damping.  It  has 
now  been  established  that  oscillations  of  this  type  were  responsible  for  the 
collapse  of  the  Tacoma  Bridge.  In  addition,  resonances  produced  by  the 
separation  of  vortices  have  been  observed  in  steel  factory  chimneys,  and  in 
the  periscopes  of  submarines. 

The  phenomenon  of  resonance  was  also  responsible  for  the  collapse  of 
the  Broughton  suspension  bridge  near  Manchester,  England  in  1831.  This 
occurred  when  a  column  of  soldiers  marched  in  cadence  over  the  bridge, 
thereby  setting  up  a  periodic  force  of  rather  large  amplitude.  The 
frequency  of  this  force  was  equal  to  the  natural  frequency  of  the  bridge. 
Thus,  very  large  oscillations  were  induced,  and  the  bridge  collapsed.  It  is 
for  this  reason  that  soldiers  are  ordered  to  break  cadence  when  crossing  a 
bridge. 

Epilog.  The  father  of  one  of  my  students  is  an  engineer  who  worked  on  the 
construction  of  the  Bronx  Whitestone  Bridge  in  New  York  City.  He 
informed  me  that  the  original  plans  for  this  bridge  were  very  similar  to  those 
of  the  Tacoma  Bridge.  These  plans  were  hastily  redrawn  following  the 
collapse  of  the  Tacoma  Bridge. 

2.6.2  Electrical  networks 

We  now  briefly  study  a  simple  series  circuit,  as  shown  in  Figure  1  below. 
The  symbol  E  represents  a  source  of  electromotive  force.  This  may  be  a 
battery  or  a  generator  which  produces  a  potential  difference  (or  voltage), 
that  causes  a  current  /  to  flow  through  the  circuit  when  the  switch  S  is 
closed.  The  symbol  R  represents  a  resistance  to  the  flow  of  current  such  as 
that  produced  by  a  lightbulb  or  toaster.  When  current  flows  through  a  coil 
of  wire  L,  a  magnetic  field  is  produced  which  opposes  any  change  in  the 
current  through  the  coil.  The  change  in  voltage  produced  by  the  coil  is  pro¬ 
portional  to  the  rate  of  change  of  the  current,  and  the  constant  of  propor- 
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AAMA 

R 


+ 


L 


C 

Figure  1.  A  simple  series  circuit 


tionality  is  called  the  inductance  L  of  the  coil.  A  capacitor,  or  condenser, 
indicated  by  C,  usually  consists  of  two  metal  plates  separated  by  a  material 
through  which  very  little  current  can  flow.  A  capacitor  has  the  effect  of  re¬ 
versing  the  flow  of  current  as  one  plate  or  the  other  becomes  charged. 

Let  Q{t)  be  the  charge  on  the  capacitor  at  time  t.  To  derive  a  differen¬ 
tial  equation  which  is  satisfied  by  Q(t)  we  use  the  following. 


Kirchoffs  second  law :  In  a  closed  circuit,  the  impressed  voltage  equals  the 
sum  of  the  voltage  drops  in  the  rest  of  the  circuit. 

Now, 


(i)  The  voltage  drop  across  a  resistance  of  R  ohms  equals  RI  (Ohm’s 
law). 

(ii)  The  voltage  drop  across  an  inductance  of  L  henrys  equals  L(dl / dt). 

(iii)  The  voltage  drop  across  a  capacitance  of  C  farads  equals  Q/C . 


Hence, 


E(t)  =  L^  +  RI+  Q 


C* 


and  since  I{t)  —  dQ{t)/ dt ,  we  see  that 


d2Q  dQ  Q 

L — =-  +  R-j-  +  %  =  E  (t). 
dt2  dt  C  w 


Notice  the  resemblance  of  Equation  (1)  to  the  equation  of  a  vibrating 
mass.  Among  the  similarities  with  mechanical  vibrations,  electrical  circuits 
also  have  the  property  of  resonance.  Unlike  mechanical  systems,  though, 
resonance  is  put  to  good  use  in  electrical  systems.  For  example,  the  tuning 
knob  of  a  radio  is  used  to  vary  the  capacitance  in  the  tuning  circuit.  In  this 
manner,  the  resonant  frequency  (see  Exercise  13,  Section  2.6)  is  changed 
until  it  agrees  with  the  frequency  of  one  of  the  incoming  radio  signals.  The 
amplitude  of  the  current  produced  by  this  signal  will  be  much  greater  than 
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that  of  all  other  signals.  In  this  way,  the  tuning  circuit  picks  out  the  desired 
station. 

Exercises 

1.  Suppose  that  a  simple  series  circuit  has  no  resistance  and  no  impressed  voltage. 
Show  that  the  charge  Q  on  the  capacitor  is  periodic  in  time,  with  frequency 
co0  =  V 1  /  LC  .  The  quantity  Vl/LC  is  called  the  natural  frequency  of  the 
circuit. 

2.  Suppose  that  a  simple  series  circuit  consisting  of  an  inductor,  a  resistor  and  a 
capacitor  is  open,  and  that  there  is  an  initial  charge  Q0=  10" 8  coulombs  on  the 
capacitor.  Find  the  charge  on  the  capacitor  and  the  current  flowing  in  the  circuit 
after  the  switch  is  closed  for  each  of  the  following  cases. 

(a)  L  =  0.5  henrys,  C  =  10-5  farads,  R  =  1000  ohms 

(b)  L  =  1  henry,  C=  10“ 4  farads,  R  =  200  ohms 

(c)  L  =  2  henrys,  C=  10“ 6  farads,  i?  =  2000  ohms 

3.  A  simple  series  circuit  has  an  inductor  of  1  henry,  a  capacitor  of  10“ 6  farads, 
and  a  resistor  of  1000  ohms.  The  initial  charge  on  the  capacitor  is  zero.  If  a  12 
volt  battery  is  connected  to  the  circuit,  and  the  circuit  is  closed  at  /  =  0,  find  the 
charge  on  the  capacitor  1  second  later,  and  the  steady  state  charge. 

4.  A  capacitor  of  10“ 3  farads  is  in  series  with  an  electromotive  force  of  12  volts 
and  an  inductor  of  1  henry.  At  t  =  0,  both  Q  and  /  are  zero. 

(a)  Find  the  natural  frequency  and  period  of  the  electrical  oscillations. 

(b)  Find  the  maximum  charge  on  the  capacitor,  and  the  maximum  current  flow¬ 
ing  in  the  circuit. 

5.  Show  that  if  there  is  no  resistance  in  a  circuit,  and  the  impressed  voltage  is  of 
the  form  £0sinco/,  then  the  charge  on  the  capacitor  will  become  unbounded  as 

» oo  if  co=  V 1  /  LC  .  This  is  the  phenomenon  of  resonance. 

6.  Consider  the  differential  equation 

•  Q 

LQ  +  RQ+  -q  —  E0cosut.  (0 

We  find  a  particular  solution  xp(t)  of  (i)  as  the  real  part  of  a  particular  solution 

<t>(  0  of 

LQ  +  RQ+^-Eoe1"1.  (ii) 

(a)  Show  that 

= - 

R  +  z 

(b)  The  quantity  Z=  R  +  z'(wL  -  1  /coC)  is  known  as  the  complex  impedance  of 
the  circuit.  The  reciprocal  of  Z  is  called  the  admittance,  and  the  real  and  im¬ 
aginary  parts  of  1/Z  are  called  the  conductance  and  susceptance.  De¬ 
termine  the  admittance,  conductance  and  susceptance. 

7.  Consider  a  simple  series  circuit  with  given  values  of  L,  R  and  C,  and  an  im¬ 
pressed  voltage  For  which  value  of  oi  will  the  steady  state  current  be  a 

maximum? 
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2.7  A  model  for  the  detection  of  diabetes 

Diabetes  mellitus  is  a  disease  of  metabolism  which  is  characterized  by  too 
much  sugar  in  the  blood  and  urine.  In  diabetes,  the  body  is  unable  to  burn 
off  all  its  sugars,  starches,  and  carbohydrates  because  of  an  insufficient 
supply  of  insulin.  Diabetes  is  usually  diagnosed  by  means  of  a  glucose 
tolerance  test  (GTT).  In  this  test  the  patient  comes  to  the  hospital  after  an 
overnight  fast  and  is  given  a  large  dose  of  glucose  (sugar  in  the  form  in 
which  it  usually  appears  in  the  bloodstream).  During  the  next  three  to  five 
hours  several  measurements  are  made  of  the  concentration  of  glucose  in 
the  patient’s  blood,  and  these  measurements  are  used  in  the  diagnosis  of 
diabetes.  A  very  serious  difficulty  associated  with  this  method  of  diagnosis 
is  that  there  is  no  universally  accepted  criterion  for  interpreting  the  results 
of  a  glucose  tolerance  test.  Three  physicians  interpreting  the  results  of  a 
GTT  may  come  up  with  three  different  diagnoses.  In  one  case  recently,  in 
Rhode  Island,  one  physician,  after  reviewing  the  results  of  a  GTT,  came 
up  with  a  diagnosis  of  diabetes.  A  second  physician  declared  the  patient  to 
be  normal.  To  settle  the  question,  the  results  of  the  GTT  were  sent  to  a 
specialist  in  Boston.  After  examining  these  results,  the  specialist  concluded 
that  the  patient  was  suffering  from  a  pituitary  tumor. 

In  the  mid  1960’s  Drs.  Rosevear  and  Molnar  of  the  Mayo  Clinic  and 
Ackerman  and  Gatewood  of  the  University  of  Minnesota  discovered  a 
fairly  reliable  criterion  for  interpreting  the  results  of  a  glucose  tolerance 
test.  Their  discovery  arose  from  a  very  simple  model  they  developed  for 
the  blood  glucose  regulatory  system.  Their  model  is  based  on  the  following 
simple  and  fairly  well  known  facts  of  elementary  biology. 

1.  Glucose  plays  an  important  role  in  the  metabolism  of  any  vertebrate 
since  it  is  a  source  of  energy  for  all  tissues  and  organs.  For  each  individual 
there  is  an  optimal  blood  glucose  concentration,  and  any  excessive  devia¬ 
tion  from  this  optimal  concentration  leads  to  severe  pathological  condi¬ 
tions  and  potentially  death. 

2.  While  blood  glucose  levels  tend  to  be  autoregulatory,  they  are  also  in¬ 
fluenced  and  controlled  by  a  wide  variety  of  hormones  and  other  metabo¬ 
lites.  Among  these  are  the  following. 

(i)  Insulin ,  a  hormone  secreted  by  the  /?  cells  of  the  pancreas.  After  we 
eat  any  carbohydrates,  our  G.I.  tract  sends  a  signal  to  the  pancreas  to 
secrete  more  insulin.  In  addition,  the  glucose  in  our  blood  directly  stimu¬ 
lates  the  /?  cells  of  the  pancreas  to  secrete  insulin.  It  is  generally  believed 
that  insulin  facilitates  tissue  uptake  of  glucose  by  attaching  itself  to  the  im¬ 
permeable  membrane  walls,  thus  allowing  glucose  to  pass  through  the 
membranes  to  the  center  of  the  cells,  where  most  of  the  biological  and 
chemical  activity  takes  place.  Without  sufficient  insulin,  the  body  cannot 
avail  itself  of  all  the  energy  it  needs. 
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(ii)  Glucagon ,  a  hormone  secreted  by  the  a  cells  of  the  pancreas.  Any  ex¬ 
cess  glucose  is  stored  in  the  liver  in  the  form  of  glycogen.  In  times  of  need 
this  glycogen  is  converted  back  into  glucose.  The  hormone  glucagon  in¬ 
creases  the  rate  of  breakdown  of  glycogen  into  glucose.  Evidence  collected 
thus  far  clearly  indicates  that  hypoglycemia  (low  blood  sugar)  and  fasting 
promote  the  secretion  of  glucagon  while  increased  blood  glucose  levels 
suppress  its  secretion. 

(iii)  Epinephrine  (adrenalin),  a  hormone  secreted  by  the  adrenal  medulla. 
Epinephrine  is  part  of  an  emergency  mechanism  to  quickly  increase  the 
concentration  of  glucose  in  the  blood  in  times  of  extreme  hypoglycemia. 
Like  glucagon,  epinephrine  increases  the  rate  of  breakdown  of  glycogen 
into  glucose.  In  addition,  it  directly  inhibits  glucose  uptake  by  muscle 
tissue;  it  acts  directly  on  the  pancreas  to  inhibit  insulin  secretion;  and  it 
aids  in  the  conversion  of  lactate  to  glucose  in  the  liver. 

(iv)  Glucocorticoids ,  hormones  such  as  cortisol  which  are  secreted  by  the 
adrenal  cortex.  Glucocorticoids  play  an  important  role  in  the  metabolism 
of  carbohydrates. 

(v)  Thyroxin ,  a  hormone  secreted  by  the  thyroid  gland.  This  hormone 
aids  the  liver  in  forming  glucose  from  non-carbohydrate  sources  such  as 
glycerol,  lactate  and  amino  acids. 

(vi)  Growth  hormone  (somatotropin),  a  hormone  secreted  by  the  anterior 
pituitary  gland.  Not  only  does  growth  hormone  affect  glucose  levels  in  a 
direct  manner,  but  it  also  tends  to  “block”  insulin.  It  is  believed  that 
growth  hormone  decreases  the  sensitivity  of  muscle  and  adipose  membrane 
to  insulin,  thereby  reducing  the  effectiveness  of  insulin  in  promoting 
glucose  uptake. 

The  aim  of  Ackerman  et  al  was  to  construct  a  model  which  would  ac¬ 
curately  describe  the  blood  glucose  regulatory  system  during  a  glucose 
tolerance  test,  and  in  which  one  or  two  parameters  would  yield  criteria  for 
distinguishing  normal  individuals  from  mild  diabetics  and  pre-diabetics. 
Their  model  is  a  very  simplified  one,  requiring  only  a  limited  number  of 
blood  samples  during  a  GTT.  It  centers  attention  on  two  concentrations, 
that  of  glucose  in  the  blood,  labelled  G ,  and  that  of  the  net  hormonal  con¬ 
centration,  labelled  H .  The  latter  is  interpreted  to  represent  the  cumulative 
effect  of  all  the  pertinent  hormones.  Those  hormones  such  as  insulin  which 
decrease  blood  glucose  concentrations  are  considered  to  increase  H,  while 
those  hormones  such  as  cortisol  which  increase  blood  glucose  concentra¬ 
tions  are  considered  to  decrease  H.  Now  there  are  two  reasons  why  such  a 
simplified  model  can  still  provide  an  accurate  description  of  the  blood 
glucose  regulatory  system.  First,  studies  have  shown  that  under  normal,  or 
close  to  normal  conditions,  the  interaction  of  one  hormone,  namely  in¬ 
sulin,  with  blood  glucose  so  predominates  that  a  simple  “lumped  parame¬ 
ter  model”  is  quite  adequate.  Second,  evidence  indicates  that  normogly- 
cemia  does  not  depend,  necessarily,  on  the  normalcy  of  each  kinetic 
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mechanism  of  the  blood  glucose  regulatory  system.  Rather,  it  depends  on 
the  overall  performance  of  the  blood  glucose  regulatory  system,  and  this 
system  is  dominated  by  insulin-glucose  interactions. 

The  basic  model  is  described  analytically  by  the  equations 


^  =  FX{G,H)  +  J{t) 

f  -f2(g,h). 


(1) 

(2) 


The  dependence  of  Fx  and  F2  on  G  and  H  signify  that  changes  in  G  and  H 
are  determined  by  the  values  of  both  G  and  H.  The  function  J(t)  is  the  ex¬ 
ternal  rate  at  which  the  blood  glucose  concentration  is  being  increased. 
Now,  we  assume  that  G  and  H  have  achieved  optimal  values  G0  and  H0  by 
the  time  the  fasting  patient  has  arrived  at  the  hospital.  This  implies  that 
FX(G0,H0)  =  0  and  F2(Go,Ho)  =  0.  Since  we  are  interested  here  in  the  devia¬ 
tions  of  G  and  H  from  their  optimal  values,  we  make  the  substitution 

g=G—G0 ,  h  =  H—H0. 


Then, 


^  =  Fl(G0+g,H0+h)  +  J(t), 
^  =  F2(G0+g,H0+h). 


Now,  observe  that 


F\  ( + 8>  Ho  +  h)  ~  F\  ( G0,  )  + 

and 


3  FX{G0,H0)  3F,(G0,//0) 

g+ - - h  +  ex 


3  G 


3  H 


F2(G0+g,H0+h)  =  F2(G0,H0)  + 


3  F2(G0,H0)  dF2(G0,H0) 


3  G 


£  + 


3  H 


h  +  e * 


where  ex  and  e2  are  very  small  compared  to  g  and  h.  Hence,  assuming  that 
G  and  H  deviate  only  slightly  from  G0  and  H0 ,  and  therefore  neglecting 
the  terms  ex  and  e2 ,  we  see  that 


dg 

dt 


3F,(G0,/f0)  3F,(Go,H0) 

g+ - ^ - h  +  J{t) 


3  G 


3  H 


dh_*  F2(G0,H0)  3  F2(G0,H0) 

dt  3  G  8+  3  H 


Now,  there  are  no  means,  a  priori,  of  determining  the  numbers 
3Fi(G0,tf0)  ^(G0,H0)  3  F2(G0,H0)  3  F2(G0,H0) 

3  G  ’  3  H  '  3  G  and  3  H 


However,  we  can  determine  their  signs.  Referring  to  Figure  1,  we  see  that 
dg/ dt  is  negative  for  g>0  and  h  =  0,  since  the  blood  glucose  concentration 
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Figure  1.  Simplified  model  of  the  blood  glucose  regulatory  system 


will  be  decreasing  through  tissue  uptake  of  glucose  and  the  storing 
of  excess  glucose  in  the  liver  in  the  form  of  glycogen.  Consequently 
0/r1(Go,//o)/3G  must  be  negative.  Similarly,  9/r1(G0,//0)/3//  is  negative 
since  a  positive  value  of  h  tends  to  decrease  blood  glucose  levels  by  facili¬ 
tating  tissue  uptake  of  glucose  and  by  increasing  the  rate  at  which  glucose 
is  converted  to  glycogen.  The  number  dF2(G0,  H0)/dG  must  be  positive 
since  a  positive  value  of  g  causes  the  endocrine  glands  to  secrete  those 
hormones  which  tend  to  increase  H.  Finally,  dF2(G0,H0)/dH  must  be 
negative,  since  the  concentration  of  hormones  in  the  blood  decreases 
through  hormone  metabolism. 

Thus,  we  can  write  Equations  (3)  and  (4)  in  the  form 


—  =  -mlg-m2h  +  J  ( t ) 
^  =  -m3h  +  m4g 


(5) 

(6) 


where  mv  m2 ,  m3 ,  and  m4  are  positive  constants.  Equations  (5)  and  (6)  are 
two  first-order  equations  for  g  and  h.  However,  since  we  only  measure  the 
concentration  of  glucose  in  the  blood,  we  would  like  to  remove  the  vari¬ 
able  h.  This  can  be  accomplished  as  follows:  Differentiating  (5)  with  re¬ 
spect  to  t  gives 


dJ_ 
dt  ’ 


Substituting  for  dh  /  dt  from  (6)  we  obtain  that 


dt 


2 


+  m2m3h  —  m2m4  g  + 


dj_ 
dt  ' 


Next,  observe  from  (5)  that  m2h  =  (-  dg/ dt)- mxg  +  J (/).  Consequently, 
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g(t)  satisfies  the  second-order  linear  differential  equation 


+  (w,  +  w3) 


dg 

dt 


+  {mxm3  +  m2m4)  g  =  m3J  + 


dJ_ 
dt  ' 


We  rewrite  this  equation  in  the  form 


^f+2«§+coo2g=S(0 


where  a  =  (m]  +  m3)/2,  o)q  =  mxm3  +  m2m4,  and  S  (t)  =  m3J  +  dJ / dt. 

Notice  that  the  right-hand  side  of  (8)  is  identically  zero  except  for  the 
very  short  time  interval  in  which  the  glucose  load  is  being  ingested.  We  will 
learn  to  deal  with  such  functions  in  Section  2.12.  For  our  purposes  here,  let 
/  =  0  be  the  time  at  which  the  glucose  load  has  been  completely  ingested. 
Then,  for  t  >  0,  g(t)  satisfies  the  second-order  linear  homogeneous  equa¬ 
tion 

+2«§+“oS  =  °-  (9) 


This  equation  has  positive  coefficients.  Hence,  by  the  analysis  in  Section 
2.6,  (see  also  Exercise  8,  Section  2.2.2)  g(t)  approaches  zero  as  t  ap¬ 
proaches  infinity.  Thus  our  model  certainly  conforms  to  reality  in  pre¬ 
dicting  that  the  blood  glucose  concentration  tends  to  return  eventually  to 
its  optimal  concentration. 

The  solutions  g(t)  of  (9)  are  of  three  different  types,  depending  as  to 
whether  a2  —  cog  is  positive,  negative,  or  zero.  These  three  types,  of  course, 
correspond  to  the  overdamped,  critically  damped  and  underdamped  cases 
discussed  in  Section  2.6.  We  will  assume  that  a2-<Og  is  negative;  the  other 
two  cases  are  treated  in  a  similar  manner.  If  a2  —  <Og<0,  then  the  character¬ 
istic  equation  of  Equation  (9)  has  complex  roots.  It  is  easily  verified  in  this 
case  (see  Exercise  1)  that  every  solution  g(t)  of  (9)  is  of  the  form 

g(t)  =  Ae~atcos(cot  —  §),  co2  =  (Og  —  a2.  (10) 

Consequently, 


G(t)  =  G0  +  Ae~at  cos(o)t  —  8).  (11) 

Now  there  are  five  unknowns  G0,  A,  a,  <o0,  and  8  in  (11).  One  way  of  de¬ 
termining  them  is  as  follows.  The  patient’s  blood  glucose  concentration  be¬ 
fore  the  glucose  load  is  ingested  is  G0.  Hence,  we  can  determine  G0  by 
measuring  the  patient’s  blood  glucose  concentration  immediately  upon  his 
arrival  at  the  hospital.  Next,  if  we  take  four  additional  measurements  Gv 
G2 ,  G3,  and  G4  of  the  patient’s  blood  glucose  concentration  at  times  tv  t2, 
t3 ,  and  /4,  then  we  can  determine  A ,  a,  <o0,  and  5  from  the  four  equations 

Gj  =  G0  +  A e  ~ atJ cos (co^  —  8 ),  j  —  1 , 2, 3, 4. 

A  second,  and  better  method  of  determining  G0,  Ay  a,  <o0,  and  8  is  to  take  n 
measurements  GVG2, ...,Gn  of  the  patient’s  blood  glucose  concentration  at 
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times  tXit2,...,tn.  Typically  n  is  6  or  7.  We  then  find  optimal  values  for  G& 
A ,  a,  co0,  and  S  such  that  the  least  square  error 

n 

E=  2  [Gy—  G0  —  Ae~atjcos(cotj—  5)]2 
7=1 

is  minimized.  The  problem  of  minimizing  E  can  be  solved  on  a  digital 
computer,  and  Ackerman  et  al  (see  reference  at  end  of  section)  provide  a 
complete  Fortran  program  for  determining  optimal  values  for  G0,  A ,  a,  co0, 
and  S.  This  method  is  preferrable  to  the  first  method  since  Equation  (1 1)  is 
only  an  approximate  formula  for  G(t).  Consequently,  it  is  possible  to  find 
values  G0 ,  A,  a ,  co0,  and  S  so  that  Equation  (11)  is  satisfied  exactly  at  four 
points  /j,  t2 ,  /3,  and  t4  but  yields  a  poor  fit  to  the  data  at  other  times.  The 
second  method  usually  offers  a  better  fit  to  the  data  on  the  entire  time  in¬ 
terval  since  it  involves  more  measurements. 

In  numerous  experiments,  Ackerman  et  al  observed  that  a  slight  error  in 
measuring  G  could  produce  a  very  large  error  in  the  value  of  a.  Hence,  any 
criterion  for  diagnosing  diabetes  that  involves  the  parameter  a  is  unreli¬ 
able.  However,  the  parameter  co0,  the  natural  frequency  of  the  system,  was 
relatively  insensitive  to  experimental  errors  in  measuring  G.  Thus,  we  may 
regard  a  value  of  co0  as  the  basic  descriptor  of  the  response  to  a  glucose 
tolerance  test.  For  discussion  purposes,  it  is  more  convenient  to  use  the 
corresponding  natural  period  r0  =  27r/co0.  The  remarkable  fact  is  that  data 
from  a  variety  of  sources  indicated  that  a  value  of  less  than  four  hours  for 
T0  indicated  normalcy ,  while  appreciably  more  than  four  hours  implied  mild 
diabetes. 

Remark  1.  The  usual  period  between  meals  in  our  culture  is  about  4  hours. 
This  suggests  the  interesting  possibility  that  sociological  factors  may  also 
play  a  role  in  the  blood  glucose  regulatory  system. 

Remark  2.  We  wish  to  emphasize  that  the  model  described  above  can  only 
be  used  to  diagnose  mild  diabetes  or  pre-diabetes,  since  we  have  assumed 
throughout  that  the  deviation  g  of  G  from  its  optimal  value  G0  is  small. 
Very  large  deviations  of  G  from  G0  usually  indicate  severe  diabetes  or  di¬ 
abetes  insipidus,  which  is  a  disorder  of  the  posterior  lobe  of  the  pituitary 
gland. 

A  serious  shortcoming  of  this  simplified  model  is  that  it  sometimes 
yields  a  poor  fit  to  the  data  in  the  time  period  three  to  five  hours  after  in¬ 
gestion  of  the  glucose  load.  This  indicates,  of  course,  that  variables  such  as 
epinephrine  and  glucagon  play  an  important  role  in  this  time  period.  Thus 
these  variables  should  be  included  as  separate  variables  in  our  model, 
rather  than  being  lumped  together  with  insulin.  In  fact,  evidence  indicates 
that  levels  of  epinephrine  may  rise  dramatically  during  the  recovery  phase 
of  the  GTT  response,  when  glucose  levels  have  been  lowered  below  fasting 
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levels.  This  can  also  be  seen  directly  from  Equation  (9).  If  a2-<o q>0,  then 
g(t)  may  have  the  form  described  in  Figure  2.  Note  that  g(t)  drops  very 
rapidly  from  a  fairly  high  value  to  a  negative  one.  It  is  quite  conceivable, 
therefore,  that  the  body  will  interpret  this  as  an  extreme  emergency  and 
thereby  secrete  a  large  amount  of  epinephrine. 

Medical  researchers  have  long  recognized  the  need  of  including  epi¬ 
nephrine  as  a  separate  variable  in  any  model  of  the  blood  glucose  regula¬ 
tory  system.  However,  they  were  stymied  by  the  fact  that  there  was  no  reli¬ 
able  method  of  measuring  the  concentration  of  epinephrine  in  the  blood. 
Thus,  they  had  to  assume,  for  all  practical  purposes,  the  the  level  of  epi¬ 
nephrine  remained  constant  during  the  course  of  a  glucose  tolerance  test. 
This  author  has  just  been  informed  that  researchers  at  Rhode  Island 
Hospital  have  devised  an  accurate  method  of  measuring  the  concentration 
of  epinephrine  in  the  blood.  Thus  we  will  be  able  to  develop  and  test  more 
accurate  models  of  the  blood  glucose  regulatory  system.  Hopefully,  this 
will  lead  to  more  reliable  criteria  for  the  diagnosis  of  diabetes. 

Reference 

E.  Ackerman,  L.  Gatewood,  J.  Rosevear,  and  G.  Molnar,  Blood  glucose  regulation 
and  diabetes,  Chapter  4  in  Concepts  and  Models  of  Biomathematics ,  F.  Heinmets, 
ed.,  Marcel  Dekker,  1969,  131-156. 

Exercises 

1.  Derive  Equation  (10). 

2.  A  patient  arrives  at  the  hospital  after  an  overnight  fast  with  a  blood  glucose  con¬ 
centration  of  70  mg  glucose/ 100  ml  blood  (mg  glucose/ 100  ml  blood  = 
milligrams  of  glucose  per  100  milliliters  of  blood).  His  blood  glucose  concentra¬ 
tion  1  hour,  2  hours,  and  3  hours  after  fully  absorbing  a  large  amount  of  glucose 
is  95,  65,  and  75  mg  glucose/ 100  ml  blood,  respectively.  Show  that  this  patient  is 
normal.  Hint :  In  the  underdamped  case,  the  time  interval  between  two  succes¬ 
sive  zeros  of  G  —  G0  exceeds  one  half  the  natural  period. 
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According  to  a  famous  diabetologist,  the  blood  glucose  concentration  of  a 
nondiabetic  who  has  just  absorbed  a  large  amount  of  glucose  will  be  at  or 
below  the  fasting  level  in  2  hours  or  less.  Exercises  3  and  4  compare  the 
diagnoses  of  this  diabetologist  with  those  of  Ackerman  et  al. 

3.  The  deviation  g(/)  of  a  patient’s  blood  glucose  concentration  from  its  optimal 
concentration  satisfies  the  differential  equation  (d2g/ dt2)  +  2a(dg/ dt)  + a2g  =  0 
immediately  after  he  fully  absorbs  a  large  amount  of  glucose.  The  time  /  is 
measured  in  minutes,  so  that  the  units  of  a  are  reciprocal  minutes.  Show  that 
this  patient  is  normal  according  to  Ackerman  et  al,  if  a  >  77/ 120  (min),  and  that 
this  patient  is  normal  according  to  the  famous  diabetologist  if 

s'(0X-(4+«)s(0). 

4.  A  patient’s  blood  glucose  concentration  G(t )  satisfies  the  initial-value  problem 

d2G  [  1  dG  [  1  c 

dt2  20  (min)  dt  2500  (min)2 

= - - - -75  mg  glucose/ 100  ml  blood; 

2500  (min) 

G  (0)  =  150  mg  glucose/ 100  ml  blood, 

1  1—  4^18/5 

G'(0)=  —  aG  (0)/(min);  a  >  ^  _  ,g/5 

immediately  after  he  fully  absorbs  a  large  amount  of  glucose.  This  patient’s  opti¬ 
mal  blood  glucose  concentration  is  75  mg  glucose/ 100  ml  blood.  Show  that  this 
patient  is  a  diabetic  according  to  Ackerman  et  al,  but  is  normal  according  to  the 
famous  diabetologist. 


2.8  Series  solutions 

We  return  now  to  the  general  homogeneous  linear  second-order  equation 

L[y]  =  p{t)^  +  Q{t)^  +  R{t)y= 0  (l) 

with  P(t)  unequal  to  zero  in  the  interval  a  <  t  <  /?.  It  was  shown  in  Section 
2.1  that  every  solution y{t)  of  (1)  can  be  written  in  the  form^(/)  =  c1  y\(t) 
+  c2^2(0?  where  yx{t)  and y2(t)  are  any  two  linearly  independent  solutions 
of  (1).  Thus,  the  problem  of  finding  all  solutions  of  (1)  is  reduced  to  the 
simpler  problem  of  finding  just  two  solutions.  In  Section  2.2  we  handled 
the  special  case  where  P,  Q ,  and  R  are  constants.  The  next  simplest  case  is 
when  P(t ),  Q(t),  and  R(t)  are  polynomials  in  t.  In  this  case,  the  form  of 
the  differential  equation  suggests  that  we  guess  a  polynomial  solution  y(t) 
of  (1).  If  y(t)  is  a  polynomial  in  t ,  then  the  three  functions  P{t)y"(t ), 
Q(t)y\t),  and  R(t)y(t)  are  again  polynomials  in  t.  Thus,  in  principle,  we 
can  determine  a  polynomial  solution  y(t)  of  (1)  by  setting  the  sums  of  the 
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coefficients  of  like  powers  of  t  in  the  expression  L[y](t)  equal  to  zero.  We 
illustrate  this  method  with  the  following  example. 


Example  1.  Find  two  linearly  independent  solutions  of  the  equation 


—  2y  =0. 


Solution.  We  will  try  to  find  2  polynomial  solutions  of  (2).  Now,  it  is  not 
obvious,  a  priori,  what  the  degree  of  any  polynomial  solution  of  (2)  should 
be.  Nor  is  it  evident  that  we  will  be  able  to  get  away  with  a  polynomial  of 
finite  degree.  Therefore,  we  set 


y{t)  =  a0+ axt  + a2t2+  ...  =  2 

n=  o 

Computing 

dy  00 

—  =  ax  +  2a2t  +  3a3t2  +  ...  =  2  nantn  1 

*  =  o 

and 

d2y 

— -  =  2a2  +  6a3t  +  ...  =  2  n(n~\)antn  2> 

n=  0 


we  see  thaty(/)  is  a  solution  of  (2)  if 

L[>’](0=  S  «(«“  l)a„/"_2-2r  2  na„tn~'-2  2  V" 

n  =  0  n  —0  a?  =  0 

OO  00  00 

=  2  "("“OV-2-2  2  «an/n-2  2  antn  =  0.  (3) 

n  =  0  «  =  0  n  =  0 

Our  next  step  is  to  rewrite  the  first  summation  in  (3)  so  that  the  expo¬ 
nent  of  the  general  term  is  n ,  instead  of  n  —  2.  This  is  accomplished  by  in¬ 
creasing  every  n  underneath  the  summation  sign  by  2,  and  decreasing  the 
lower  limit  by  2,  that  is, 


00  00 

2  n{n- l)antn~2=  2  (*  +  2)(n+  \)an+2t". 

n  =  0  n  =  —  2 

(If  you  don’t  believe  this,  you  can  verify  it  by  writing  out  the  first  few 
terms  in  both  summations.  If  you  still  don’t  believe  this  and  want  a  formal 
proof,  set  m  =  n  —  2.  When  n  is  zero,  m  is  —  2  and  when  n  is  infinity,  m  is 
infinity.  Therefore 

00  00 

2  n(n- \)antn~2=  2  (m  +  2)(m+ l)am+2tm, 

n  =  0  m  =  —2 

and  since  m  is  a  dummy  variable,  we  may  replace  it  by  n.)  Moreover,  ob¬ 
serve  that  the  contribution  to  this  sum  from  n  ==  —  2  and  n  =  —  1  is  zero 
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since  the  factor  ( n  +  2)(n+  1)  vanishes  in  both  these  instances.  Hence, 

00  00 

2  n(n  —  \)antn~2=  2  (n  +  2)(«  +  l)W" 

n=0  n— 0 

and  we  can  rewrite  (3)  in  the  form 

2  (n  +  2)(n  +  \)an+2t"  —  2  2  nant"-2  2  antn  =  0.  (4) 

n  —  0  n  =  0  n  =  0 

Setting  the  sum  of  the  coefficients  of  like  powers  of  t  in  (4)  equal  to  zero 
gives 

(n  +  2){n+  l)an+2-2na„-2a„  =  0 

so  that 

2(n+l)a„  _  2 a„ 

an  +  2  /M  .  .  i  \  P) 


(n  +  2)(n+  1)  az  +  2  ‘ 


Equation  (5)  is  a  recurrence  formula  for  the  coefficients  a0,aXfa2,a3,... . 
The  coefficient  an  determines  the  coefficient  an+2.  Thus,  a0  determines  a2 
through  the  relation  a2  =  2a0/2  =  a0 ;  a2 ,  in  turn,  determines  a4  through  the 
relation  a4  =  2a2/(2  +  2)  =  a0/2;  and  so  on.  Similarly,  ax  determines  a3 
through  the  relation  a3  =  2ax/(2  +  \)  =  2ax/3;  a3 ,  in  turn,  determines  a5 
through  the  relation  a5  =  2a3/(3  +  2)  =  4a1/3*5;  and  so  on.  Consequently, 
all  the  coefficients  are  determined  uniquely  once  aQ  and  ax  are  prescribed. 
The  values  of  a0  and  ax  are  completely  arbitrary.  This  is  to  be  expected, 
though,  for  if 

y  (/)  =  a0+  axt  +  a2t2+  ... 

then  the  values  of  y  andy'  at  /  =  0  are  a0  and  ax  respectively.  Thus,  the 
coefficients  a0  and  ax  must  be  arbitrary  until  specific  initial  conditions  are 
imposed  on  >>. 

To  find  two  solutions  of  (2),  we  choose  two  different  sets  of  values  of  a0 
and  ax.  The  simplest  possible  choices  are  (i)  a0=  l,ax  =  0;  (ii)  a0  =  0,  ax  =  1. 

(i)  a0=  1,  ax=  0. 

In  this  case,  all  the  odd  coefficients  ax,a3,a5,...  are  zero  since  a3  =  2ax/3  = 
0,  a5  =  2a3/5  =  0,  and  so  on.  The  even  coefficients  are  determined  from  the 
relations 


a2  =  a0=\, 


a4  = 


a6  = 


2  a 4 


1 

2-3  ’ 


and  so  on.  Proceeding  inductively,  we  find  that 


= 


2-3* • • n  n! 


Hence, 


2,  t  it  ,  ,2 


yx(t)  =  \  +  t  +2[+3[  +  ---=e 
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is  one  solution  of  (2). 

(ii)  <*o=0,  a,  =  1. 

In  this  case,  all  the  even  coefficients  are  zero,  and  the  odd  coefficients  are 
determined  from  the  relations 

_ 2ai  _  2  _  _  2a3  _  2  2  _  2a5  _  2  2  2 

^3  ^  ^  5  ^5  ^  ^  3  >  #7  ""  y  ”  y  ^  ^  ) 

and  so  on.  Proceeding  inductively,  we  find  that 


a 


2* 


2aj+  1 


Thus, 


3-5- •  •  (2/i  +  l)  * 


,  ,  2 13  ,  2V  ,  V  2"tZn+l 

—  +  XT  +  --2 ,3.5...  (2„ 

/7=0  V 


(2rt  +  1) 


is  a  second  solution  of  (2). 


Now,  observe  that  and  y2(0  are  polynomials  of  infinite  degree, 
even  though  the  coefficients  Pit)  —  1>  Q(0~  “2f,  and  R(t)  =  -  2  are  poly¬ 
nomials  of  finite  degree.  Such  polynomials  are  called  power  series.  Before 
proceeding  further,  we  will  briefly  review  some  of  the  important  properties 
of  power  series. 

1.  An  infinite  series 

00 

y(t)  =  a0+al(t-t0)  +  a2(t-t0f+ ...=  2an(/_'o)"  (6) 

n  =  0 

is  called  a  power  series  about  t  —  tQ. 

2.  All  power  series  have  an  interval  of  convergence.  This  means  that  there 
exists  a  nonnegative  number  p  such  that  the  infinite  series  (6)  converges 
for  1 1  — 10 1  <  p,  and  diverges  for  1 1  — 10  \  >  p.  The  number  p  is  called  the 
radius  of  convergence  of  the  power  series. 

3.  The  power  series  (6)  can  be  differentiated  and  integrated  term  by  term, 
and  the  resultant  series  have  the  same  interval  of  convergence. 

4.  The  simplest  method  (if  it  works)  for  determining  the  interval  of  conver¬ 
gence  of  the  power  series  (6)  is  the  Cauchy  ratio  test.  Suppose  that  the 
absolute  value  of  an+l/an  approaches  a  limit  A  as  n  approaches  infinity. 
Then,  the  power  series  (6)  converges  for  1 1—  f0|  <  1  /A,  and  diverges  for 
|/-  f0|  >  1/A. 

5.  The  product  of  two  power  series  2  toT  anc*  ^(*”*0)" 

is  again  a  power  series  of  the  form  2  *o)">  cn  =  a <Ai  + 

axbn_  1  +  . . .  +  anb0.  The  quotient 

aQ+alt  +  a2t2  +  ... 

b§  4“  b  1 1 4-  b2t 2  +  . . . 

of  two  power  series  is  again  a  power  series,  provided  that  b0¥=  0. 
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6.  Many  of  the  functions  f(t)  that  arise  in  applications  can  be  expanded  in 
power  series;  that  is,  we  can  find  coefficients  a0,ava2,".  so  that 

00 

/(O  =  a0+al('-*o)  +  a2(*-'o)2+  •••  =  o)”  (7) 

n  =  0 

Such  functions  are  said  to  be  analytic  at  t  =  /0,  and  the  series  (7)  is  called 
the  Taylor  series  of  /about  t=  t0.  It  can  easily  be  shown  that  if  /  admits 
such  an  expansion,  then,  of  necessity,  an=f(n\t0)/n\9  where  f^n\t)  — 
dnf{f)/dtn. 

7.  The  interval  of  convergence  of  the  Taylor  series  of  a  function  /(/),  about 
t09  can  be  determined  directly  through  the  Cauchy  ratio  test  and  other 
similar  methods,  or  indirectly,  through  the  following  theorem  of  complex 
analysis. 

Theorem  6.  Let  the  variable  t  assume  complex  values ,  and  let  z0  be  the  point 
closest  to  t0  at  which  f  or  one  of  its  derivatives  fails  to  exist.  Compute  the 
distance  p,  in  the  complex  plane ,  between  t0  and  z0.  Then ,  the  Taylor  series 
of  f  about  t0  converges  for  |  l  'ol<P>  and  diverges  for  1 1  *ol 

As  an  illustration  of  Theorem  6,  consider  the  function  f(t)  =  1/(1  + 12). 
The  Taylor  series  of  /  about  t  =  0  is 

- -  —  1  —  t2  1*  —  /6  +  ..., 

1  +  /2 

and  this  series  has  radius  of  convergence  one.  Although  the  function 
(l  +  r2)-1  is  perfectly  well  behaved  for  t  real,  it  goes  to  infinity  when  t  =  ±i9 
and  the  distance  of  each  of  these  points  from  the  origin  is  one. 

A  second  application  of  Theorem  6  is  that  the  radius  of  convergence  of 
the  Taylor  series  about  t  =  0  of  the  quotient  of  two  polynomials  a(t)  and 
b(t)9  is  the  magnitude  of  the  smallest  zero  of  b(t). 

At  this  point  we  make  the  important  observation  that  it  really  wasn’t 
necessary  to  assume  that  the  functions  P(t\  Q(t ),  and  R  (t)  in  (1)  are  poly¬ 
nomials.  The  method  used  to  solve  Example  1  should  also  be  applicable  to 
the  more  general  differential  equation 

L[y]  =  P(t)^  +  Q(t)^  +  R(t)y  =  0 

where  P(t)9  Q(t ),  and  R(t )  are  power  series  about  tQ.  (Of  course,  we  would 
expect  the  algebra  to  be  much  more  cumbersome  in  this  case.)  If 

P(t)=Po+P\{t-t0)+  ,  Q(t)  =  q0+ql(t-t0)+  , 

P(t)  =  r0+rl(t-t0)+  ... 

andy(f)  =  a0  -f  aft  — 10)  +  ...  9  then  L[y](t )  will  be  the  sum  of  three  power 
series  about  t  =  t0.  Consequently,  we  should  be  able  to  find  a  recurrence 
formula  for  the  coefficients  an  by  setting  the  sum  of  the  coefficients  of  like 
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powers  of  t  in  the  expression  L[y](t)  equal  to  zero.  This  is  the  content  of 
the  following  theorem,  which  we  quote  without  proof. 

Theorem  7.  Let  the  functions  Q(t)/P(t)  and  R(t)/P(t)  have  convergent 
Taylor  series  expansions  about  t=t0,for  \t  —  t0\<p.  Then ,  every  solution 
y{t)  of  the  differential  equation 

P(t)^f  +  Q{t)^-t+R{t)y=0  (8) 

is  analytic  at  t=  t0,  and  the  radius  of  convergence  of  its  Taylor  series  ex¬ 
pansion  about  t  — t0  is  at  least  p.  The  coefficients  a2,a3, in  the  Taylor 
series  expansion 

y(t)  =  a0+al(t-t0)  +  a2(t-t0)2+ ...  (9) 

are  determined  by  plugging  the  series  (9)  into  the  differential  equation  (8) 
and  setting  the  sum  of  the  coefficients  of  like  powers  of  t  in  this  expression 
equal  to  zero. 

Remark.  The  interval  of  convergence  of  the  Taylor  series  expansion  of  any 
solution  y{t)  of  (8)  is  determined,  usually,  by  the  interval  of  convergence  of 
the  power  series  Q(t)/ P(t)  and  R(t)/P(t ),  rather  than  by  the  interval  of 
convergence  of  the  power  series  P(t ),  Q(t ),  and  R(t).  This  is  because  the 
differential  equation  (8)  must  be  put  in  the  standard  form 

^7+P(»)f  +«(<b’=° 

whenever  we  examine  questions  of  existence  and  uniqueness. 


Example  2. 

(a)  Find  two  linearly  independent  solutions  of 


d2y  .  3t  dy  .  l  =n 

dt2  1  +  t*dt  i  +  t2* 


(b)  Find  the  solution  y(t)  of  (10)  which  satisfies  the  initial  conditions  >>(0) 
=  2,/(0)  =  3. 

Solution. 

(a)  The  wrong  way  to  do  this  problem  is  to  expand  the  functions  3t/(l  + 12) 
and  1/(1  +  t2)  in  power  series  about  t  —  0.  The  right  way  to  do  this  problem 
is  to  multiply  both  sides  of  (10)  by  1  + 12  to  obtain  the  equivalent  equation 

We  do  the  problem  this  way  because  the  algebra  is  much  less  cumbersome 
when  the  coefficients  of  the  differential  equation  (8)  are  polynomials  than 
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when  they  are  power  series.  Setting  y{t)—  2T=o ant">  we  comPute 

OO  00  00 

^[>,](0  =  (1  +  ?2)  2  n(n-l)a„tn~2  +  3t  2  na„tn~l+  2  V" 

n  =  0  n- 0  n  =  0 

00  00 

=  2  n(n-l)a„t"~2  +  2  [«(«- l)  +  3n  +  l]a„t'’ 

«=0  n=0 

=  2  (n  +  2)(n  +  l)a„+2t"  +  2  («  +  1)2«n/'’- 

/j  =  0  n  =  0 


Setting  the  sum  of  the  coefficients  of  like  powers  of  t  equal  to  zero  gives 
(rt  +  2)(«+  \)an+2  +  (fl  +  \)2an  =  Q.  Hence, 


(”+l  )\ 

(n  +  2)(n  + 1) 


(n+\)a„ 
n  +  2 


Equation  (11)  is  a  recurrence  formula  for  the  coefficients  a2,tf3,...  in  terms 
of  a0  and  av  To  find  two  linearly  independent  solutions  of  (10),  we  choose 
the  two  simplest  cases  (i)  a0=  1,  ax  =  0;  and  (ii)  a0  =  0,  ax  =  1. 

(i)  a0=  1,  ax  =  0. 


In  this  case,  all  the  odd  coefficients  are  zero  since  a3=  —2al/3  =  0,  a5  = 
-4a3/5  =  0,  and  so  on.  The  even  coefficients  are  determined  from  the  rela¬ 
tions 

ao  1  3a2^1-3  _  5aA  _  1-3-5 


and  so  on.  Proceeding  inductively,  we  find  that 


Thus, 


y 


1  ‘3~  •  •  (2/i—  1) 

a2"  (  2-4- • -2  n  ^  ^  2"n! 


■2  1-3  a.  V»  ,  1*3 —  (2»—l) 


i(0  =  1_  y  +  ••• =  2  (_1) 


f2/i 


*  =  0 


2"n! 


(12) 


is  one  solution  of  (10).  The  ratio  of  the  (n  +  l)st  term  to  the  nth  term  of 

yM  is 

1  -3  -  •  •  (2n—  l)(2n+  l)/2n+2  2 V  ~{2n+\)t 2 

2"+1(n+  1)!  1-3-  •  •  (2n  —  l)r2n  _  2(n  +  l)  ’ 

and  the  absolute  value  of  this  quantity  approaches  t2  as  n  approaches  in¬ 
finity.  Hence,  by  the  Cauchy  ratio  test,  the  infinite  series  (12)  converges  for 
|?|  <  1,  and  diverges  for  |r|  >  1. 

(ii)  a0  =  0,  a,  =  l. 
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In  this  case,  all  the  even  coefficients  are  zero,  and  the  odd  coefficients  are 
determined  from  the  relations 

2a,  2  _  4a3  _  2-4  _  6as  _  2-4-6 

and  so  on.  Proceeding  inductively,  we  find  that 


a2«+l 


2-4- •  •  2n 
3*5* • • (2rt + 1) 


(-1  )n2nn\ 
3*5*  •  •  (2«  + 1) 


Thus, 


2  3.  2-4.5 


00 


...-2 


/?~0 


3*5*  •  •  (2«  +  1) 


t 


2n+l 


is  a  second  solution  of  (10),  and  it  is  easily  verified  that  this  solution,  too, 
converges  for  |/|<  1,  and  diverges  for  |/|>  1.  This,  of  course,  is  not  very 
surprising,  since  the  Taylor  series  expansions  about  t  =  0  of  the  functions 
3//(l  + 12)  and  1  /(I  +  t 2)  only  converge  for  |*|  <  1. 

(b)  The  solution yx(t)  satisfies  the  initial  conditions  y(0)  =  l,y'(0)  =  0>  while 
y2(t)  satisfies  the  initial  conditions >>(0)  =  0, y'(0)  = 1.  Hence y(t)  =  2yx(t)  + 

3 y2(0- 


Example  3.  Solve  the  initial-value  problem 

L[y]=^+t2^+2ty=0>  ^(°)=1>  /(o)-o. 


Solution.  Setting y(t)  =  ^™_0antn,  we  compute 

L[y]{t)=  2  n(n-l)ant"~2+ 12  2  na„tn~l  +  2t  2  V" 

n®0  /?■*  0  n  =  0 

=  2  n(n-l)a„tn~2+  2  nantn+l  +  2  2  a„t',+l 

/!■  0  n=*Q  n**  0 

=  2  n(n-l)a„tn~2+  2  (n  +  2)a„t”+1. 

«  =  0  /?  =  0 

Our  next  step  is  to  rewrite  the  first  summation  so  that  the  exponent  of  the 
general  term  is  n  + 1  instead  of  n  —  2.  This  is  accomplished  by  increasing 
every  n  underneath  the  summation  sign  by  3,  and  decreasing  the  lower 
limit  by  3;  that  is, 

2  «(«-l)a„r"-2=  2  («  +  3)(«  +  2)an+3r"+1 

/i  *■  0  n  *  —  3 

=  2  («  +  3)(«  +  2)an+3r"+1. 

— i 
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Therefore, 


00 


00 


L[y]{t)=  2  («  +  3)(«  +  2 K+3/n+1+  2  (n  +  2)a„t 

n  —  —  1  «  =  0 


n+  1 


oo 


00 


=  2a2  +  2  («  +  3)(«  +  2)aw+3//l+1+  ^  (fl +  2)a/l/'!+1. 

«=0  n=0 

Setting  the  sums  of  the  coefficients  of  like  powers  of  t  equal  to  zero  gives 
2a2  =  0,  and  ( n  +  3)(n  +  2)an+3  +  (n  +  2)an  =  0 ;  n  —  0,1,2,... 

Consequently, 


a2  =  0,  and  an+3=  ~ 


a 


n 


n  +  3  ’ 


n>  0. 


(14) 


The  recurrence  formula  (14)  determines  a3  in  terms  of  a0 ,  a4  in  terms  of 
al9  a5  in  terms  of  a2 ,  and  so  on.  Since  a2  =  0,  we  see  that  a5,as,alv...  are  all 
zero,  regardless  of  the  values  of  a0  and  av  To  satify  the  initial  conditions, 
we  set  a0=  1  and  ax=Q.  Then,  from  (14),  a4,al9al0,...  are  all  zero,  while 


a 


o 


1 


a , 


1 


Cl3  A  A  )  ^  A  J 

J  3  o  J  *  o 


1 


Qg  9  3-6-9 


and  so  on.  Proceeding  inductively,  we  find  that 

(-1)"  (“I)"  (-0 


n 


«3»  = 


3*6  -  •  3 n  3"  1  * 2  -  -  « 


3”/i! 


Hence, 


00 


1  t3  .  t6  t9  ,  _  V  ^  ?3" 

3  +  3-6  3-6-9  +"  ^  3  V 

n  =  0 


By  Theorem  7,  this  series  converges  for  all  t ,  since  the  power  series  /2  and 
2/  obviously  converge  for  all  t.  (We  could  also  verify  this  directly  using  the 
Cauchy  ratio  test.) 


Example  4.  Solve  the  initial-value  problem 

2 

L[y]=(t2-2t)^  +5(t-l)^  +3y=0;  >-(l)  =  7,  /(1)  =  3.  (15) 

Solution .  Since  the  initial  conditions  are  given  at  /=  1,  we  will  express  the 
coefficients  of  the  differential  equation  (15)  as  polynomials  in  (/- 1),  and 
then  we  will  findy(0  as  a  power  series  centered  about  t=  1.  To  this  end, 
observe  that 

r2-2r  =  r(r-2)=[(r-l)  +  l][(r-l)-l]  =  (r-l)2-l. 

Hence,  the  differential  equation  (15)  can  be  written  in  the  form 
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Setting  y  (t)  =  2  7=o an(l~  0".  compute 


L[>;](;)=[(;“1)2-l]  2  n{n-\)an(t-\) 


n  —  2 


n  =  0 


oo 


00 


+  5(t-l)  2  «»,,(*- 1)"_1  +  3  2  *„('-!)" 

/i  =  0  /?  =  0 

■~2  w(«  —  l)a„(/—  l)”-2 
«  =  0 

00  00 

+  2  «(«“  !)«„<>- 0"  +  2  (5«  +  —  1)" 

«=0  H=0 

=  -  2  («  +  2)(rt  +  l)an+2(?-l)"  +  2  (n2  +  4«  +  3)a„(t 

n  =  0  n  =  0 


-i)n. 


Setting  the  sums  of  the  coefficients  of  like  powers  of  t  equal  to  zero  gives 
—  (>i  +  2)(rt  +  \)an+2  +  (nl  +  ^n  +  3)^  =  0,  so  that 


n2  +  4«  +  3  «  +  3  .  A 

- - — - -  a„  = - -  a„,  n>  0. 

(/i  +  2)(*+l)  "  /i  +  2 


(16) 


To  satisfy  the  initial  conditions,  we  set  a0  =  l  and  ^  =  3.  Then,  from  (16), 

_  3 
a2  2 1 

4 

a3=3« 

and  so  on.  Proceeding  inductively,  we  find  that 

3-5 —  (2n+  1)  4-6  -  •  •  (2n  +  2) 

a2n=-n, — and  «2n+i  =  — - .  ^-3  (forn>l). 


II 

to|u> 

• 

II 

-M  ^ 

£ 

to 

II 

5-3  , 
4-2 -7, 

7 

«6=  ga4  = 

7-5-3  - 
6-4-2 

=  —  3 

3 

6 

«5=  5«3  = 

6-4  , 

sT3’ 

8 

a7~  7a5  — 

8*6*4  ~ 
7-5-3 

2*4* • • (2 n) 


3*5*  •  •  (2n+  1) 


Hence, 

y(t)  =  l  +  3(t  —  1)2  +  y  *3(/  —  1)3+... 


00 


=  7  +  72 

n-  1 


3-5---(2n+l)(t-l) 
2V 


2ai 


+  3(/-1)  +  3S  3-5  -  •  •  (2n  +  1) 


2n  +  1 


Example  5.  Solve  the  initial-value  problem 

^[>’]=(i-0^7  +  ^+(i-0>’=0;  j(o)=i,  /(o)=i. 

Solution.  Setting _y(?)=  we  compute 
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00 

£[/](')  =  (!-')  2  n{n-\)antn~ 2 

n  =  0 

+  2  2  vn 

/i  =  0  n  —  0 


oo 


00 


=  2  «(«“  l)^*"  2~  2 

n= 0  w=0 


- 1 


00 


00 


00 


+  2  nant"- '+  2  V"“  2  antn+X 

n=0  n=0  n=0 


oo 


00 


=  2  ("  +  2)(«  +  l)a„+2?n-  2  n(n-2)a„tn 

n  —  0  aj  =  0 


—  1 


00 


00 


+  2  antn-  2  V"+I 

«  =  0  n  —  0 


oo 


00 


=  2  («  +  2)(«  +  l)an+2/n-  2  («+l)(#i-l)fl(f+l/" 

aj  =  0  n  —  0 


00 


00 


+  2  °n 2  On-l t" 
n=0  n = 1 

=  2^2  +  Ctx  +  CIq 
oo 

+  2  {(«  +  2)(«+1K+2-("  +  1)(«-0tf„+i  +  tfn-a„-i}'" 

1 

Setting  the  coefficients  of  each  power  of  t  equal  to  zero  gives 

(n+l)(n- \)an+y-a„  +  an_i 
and  - (»  +  2)(„+l) - '  n>'' 


a~>=  - 


ax  +  a0 


(17) 

To  satisfy  the  initial  conditions,  we  set  a0  =  1  and  ax  =  1.  Then,  from  (17), 

3a3  —  a2  +  ax  i 

=  6’ 


—  ax  +  a0 

a7=-  1,  a,= - r - =0, 


a4  = 


12 


as  = 


8  a4—a3  +  a2  j 


20 


60’ 


a6  = 


15  a5  —  a4  +  a3 
30 


1 


360 


and  so  on.  Unfortunately,  though,  we  cannot  discern  a  general  pattern  for 
the  coefficients  an  as  we  did  in  the  previous  examples.  (This  is  because  the 
coefficient  an+2  depends  on  the  values  of  a„+1,  an,  and  an_x,  while  in  our 
previous  examples,  the  coefficient  an+2  depended  on  only  one  of  its  prede¬ 
cessors.)  This  is  not  a  serious  problem,  though,  for  we  can  find  the 
coefficients  an  quite  easily  with  the  aid  of  a  digital  computer.  Sample  Pascal 
and  Fortran  programs  to  compute  the  coefficients  a2,...,an  in  terms  of  a0 
and  av  and  to  evaluate  the  “approximate”  solution 

y(t)  =  a0+ait+ ...  +antn 
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2  Second-order  linear  differential  equations 


at  any  point  t  are  given  below.  These  programs  have  variable  values  for  a0 
and  av  so  they  can  also  be  used  to  solve  the  more  general  initial- value 
problem 


(i-0 


d2y 
dt 2 


+  |+d-<)y=°; 


>'(0)  =  a0>  /(O)^ 01- 


Pascal  Program 


Program  Series  (input,  output); 
var 

A:  array [0 . .  199]  of  real; 

T,  sum:  real; 
k,  N:  integer; 

begin 

readln(A[0],  A[1],T,  N); 
page; 

A[2]  :=  — 0.5*  (A[1]  +  A[0]); 
sum  :=A[0]  +  A[1]  *T+A[2]  *T  *T; 
for  k:=1  to  N  — 2  do 
begin 

A[k  +  2]:=((k+1)*(k-1)*A[k  +  1]-A[k]  +  A[k-1]) 
/((k  +  1)*(k  +  2)); 

sum  :=sum  +  A[k  +  2]  *exp((k  +  2)*  In  (T)); 
end; 

writeln(‘For  N  =  ’  ,N:3,  ’andT=’  ,T:6:4); 
writeln(‘the sum  is:  \sum:11:9); 
end. 


Fortran  Program 


10 


1 


20 
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DIMENSION  A(200) 

READ  (5,  10)  A0,  A(1),  T,  N 
FORMAT  (3F1 5.8,  15) 

A(2)  =  -  0.5  *  (A(1 )  +  A0) 

A(3)  =  (A0  —  A(1  ))/2.  *  3. 

SUM  =  A0  +  A(1) * T  +  A(2) * T *  *2  +  A(3)*T*  *3 
NA=N— 2 
DO 20  K  =  2,  NA 

A(K  +  2)  =  (A(K  —  1 )  -  A(K)  +  (K  + 1 .)  *  (K  - 1 .)  * 

A(K  + 1  ))/(K  + 1 .)  *  (K  +  2.) 

SUM  =  SUM  +  A(K  +  2) * T *  *(K  +  2) 

CONTINUE 

WRITE  (6,30)  N,T,  SUM 

FORMAT  (1  HI ,  ‘FOR  N  =  ‘,  13,  ‘,  AND  T  =  ’,  FI  0.4/1  H,  THE 
SUM  IS’,  F20.9) 

CALL  EXIT 
END 


2.8  Series  solutions 


See  also  C  Program  14  in  Appendix  C  for  a  sample  C  program. 

Setting  A\_ 0]  =  1,  A[l]  =  l,  (A(  1)=1  for  the  Fortran  program),  T= 0.5,  and 
N  =  20  in  these  programs  gives 

+  tfi(|)  +  ...  +a2  o(^)  =  1.26104174. 

This  result  is  correct  to  eight  significant  decimal  places,  since  any  larger 
value  of  N  yields  the  same  result. 


Exercises 

Find  the  general  solution  of  each  of  the  following  equations. 

1.  y"  +  ty'+y  =  0  2.  y"-ty  =  0 

3.  (2  +  t2)y"  —  ty'  —  3y  =  0  4.y"-t3y  =  0 

Solve  each  of  the  following  initial-value  problems. 

5.  t(2-t)y”-6(t-\)y'-4y=0;  y(l)  =  1,  /(1)  =  0 

6.  y "  +  t2y  =  0;  y  (0)  =  2,  y\ 0)  =  -  1 

7.  y"  —  t3y  =  0;  y(0)  =  0,  /(0)=  -2 

8.  y"  +  (/2  +  2/+  l)y'  — (4  +  4/)y  =  0;  y(-l)  =  0,  /(-1)=1 

9.  The  equation  y"  —  2/y'+Ay =0,  X  constant,  is  known  as  the  Hermite  differential 
equation,  and  it  appears  in  many  areas  of  mathematics  and  physics. 

(a)  Find  2  linearly  independent  solutions  of  the  Hermite  equation. 

(b)  Show  that  the  Hermite  equation  has  a  polynomial  solution  of  degree  n  if 
X  —  2n.  This  polynomial,  when  properly  normalized;  that  is,  when  multi¬ 
plied  by  a  suitable  constant,  is  known  as  the  Hermite  polynomial  Hn(t), 

10.  The  equation  (1  -  t2)y"  —  2(y'  +  a(a  +  \)y  =  0,  a  constant,  is  known  as  the 
Legendre  differential  equation,  and  it  appears  in  many  areas  of  mathematics 
and  physics. 

(a)  Find  2  linearly  independent  solutions  of  the  Legendre  equation. 

(b)  Show  that  the  Legendre  differential  equation  has  a  polynomial  solution  of 
degree  n  if  a  =  n. 

(c)  The  Legendre  polynomial  Pn{t)  is  defined  as  the  polynomial  solution  of  the 
Legendre  equation  with  a  =  n  which  satisfies  the  condition  P„(l)=  1.  Find 
Po(0,  P i(0>  ^2(0>  and  7*3(0* 

11.  The  equation  (1  —  t2)y"  —  ty'  +  a2y =0,  a  constant,  is  known  as  the  Tchebycheff 
differential  equation,  and  it  appears  in  many  areas  of  mathematics  and  physics. 

(a)  Find  2  linearly  independent  solutions  of  the  Tchebycheff  equation. 

(b)  Show  that  the  Tchebycheff  equation  has  a  polynomial  solution  of  degree  n 
if  a  =  n.  These  polynomials,  when  properly  normalized,  are  called  the 
Tchebycheff  polynomials. 
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12.  (a)  Find  2  linearly  independent  solutions  of 

y"  +  t3y'  +  3t2y  =0. 

(b)  Find  the  first  5  terms  in  the  Taylor  series  expansion  about  /  =  0  of  the  solu¬ 
tion  y(t)  of  the  initial-value  problem 

y"  +  t3y'  +  3t2y  =  e‘;  y(0)  =  0,  j>'(0)  =  0. 

In  each  of  Problems  13-17,  (a)  Find  the  first  5  terms  in  the  Taylor  series 

expansion  ^j™=0antn  of  the  solution  y(t)  of  the  given  initial- value  prob¬ 
lem.  (b)  Write  a  computer  program  to  find  the  first  N  + 1  coefficients 
a0,au...,aN,  and  to  evaluate  the  polynomial  a0  +  axt+  ...  +aNtN.  Then, 

obtain  an  approximation  of  y(\)  by  evaluating  2  n_0an(2)n- 

13.  (l-t)y"  +  ty'+y  =  0-,  y(0)=l,/(0)  =  0 

14.  y"  +y'  +  ty=0;  y(0)=  -  1,  /(0)  =  2 

15. y"  +  ty’  +  e,y=0;  y(0)=  1,  /(0)  =  0 

16.  y"  +y'  +  e‘y=0;  y(0)  =  0,  /(0)  =  - 1 

17. y"+/  +  e-'y=0;  y(0)  =  3,  /(0)  =  5 

2.8.1  Singular  points,  Euler  equations 
The  differential  equation 

L[y]  =  P(t)^  +  Q(t)^  +  R(t)y  =  0  (1) 

is  said  to  be  singular  at  t  =  t0  if  P(^0)=:0.  Solutions  of  (1)  frequently 
become  very  large,  or  oscillate  very  rapidly,  in  a  neighborhood  of  the 
singular  point  /0.  Thus,  solutions  of  (1)  may  not  even  be  continuous,  let 
along  analytic  at  f0,  and  the  method  of  power  series  solution  will  fail  to 
work,  in  general. 

Our  goal  is  to  find  a  class  of  singular  equations  which  we  can  solve  for  t 
near  f0.  To  this  end  we  will  first  study  a  very  simple  equation,  known  as 
Euler’s  equation,  which  is  singular,  but  easily  solvable.  We  will  then  use  the 
Euler  equation  to  motivate  a  more  general  class  of  singular  equations  which 
are  also  solvable  in  the  vicinity  of  the  singular  point. 

Definition.  The  differential  equation 

L[y]  =  t^  +  atjj  +  /3y  =  0.  (2) 

where  a  and  j3  are  constants  is  known  as  Euler’s  equation. 
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We  will  assume  at  first,  for  simplicity,  that  t  >  0.  Observe  that  t2y"  and 
ty'  are  both  multiples  of  tr  if  y  —  tr.  This  suggests  that  we  try  y  =  tr  as  a 
solution  of  (2).  Computing 

-7 -tr=rtr'x  and  tr  =  r(r  —  \)tr~2 
dt  dt2 

we  see  that 

L[tr]  —  r{r  —  \)tr  +  artr  +  fltr 
=  [r(r  —  1 )  +  ar  +  (i]tr 

=  F(r  )tr  (3) 

where 

F(r)  =  r(r  —  \  )+  ar  +  fi 

=  r2  +  (a  —  l)r  +  /?.  (4) 

Hence,  y  =  tr  is  a  solution  of  (2)  if,  and  only  if,  r  is  a  solution  of  the 
quadratic  equation 

r2  +  (a  —  l)r  +  ft  =  0.  (5) 

The  solutions  r,,  r2  of  (5)  are 

ri  =  ~2  (a-  l)  +  /(a-  l)2-4^ 

'2=_2  (a-l)-/(a-l)2-4/J  . 

Just  as  in  the  case  of  constant  coefficients,  we  must  examine  separately  the 
cases  where  (a  -  l)2  -4/?  is  positive,  negative,  and  zero. 

Case  1.  (a  —  l)2  —4/3  ^  0.  In  this  case  Equation  (5)  has  two  real,  unequal 
roots,  and  thus  (2)  has  two  solutions  of  the  form  y\{t)  —  tr\y2{t)  —  tr2. 
Clearly,  tK]  and  trj  are  independent  if  r,  ^r2.  Thus  the  general  solution  of 
(2)  is  (for  t  >  0) 

y(t)  =  c,C'  -f  c2tri. 

Example  1.  Find  the  general  solution  of 

L[y]  =  t2~^j  +4t^dt  =  (6) 

Solution.  Substituting^  =  tr  in  (6)  gives 

L[tr\  =  r{r  -\)tr  +  4rtr  +  2tr 
=  [r(r  —  l)  +  4r  +  2]tr 
=  (r2  +3r  +2)tr 
=  (r  +  l)(r+2)tr 
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Hence  rx  —  —  1,  r2  —  —  2  and 

y(t)  =  clr'  +  c2r*  =  ?j-  +  % 

t  t 

is  the  general  solution  of  (6). 

Case  2.  (a  —  l)2  —4/?  =  0.  In  this  case 

1  —  a 

rl  =  r2  =  — 

and  we  have  only  one  solution y  —  tr 1  of  (2).  A  second  solution  (see  Exercise 
11)  can  be  found  by  the  method  of  reduction  of  order.  However,  we  would 
like  to  present  here  an  alternate  method  of  obtaining  which  will  generalize 
very  nicely  in  Section  2.8.3.  Observe  that  F(r)  —  (r  —  rx)2  in  the  case  of 
equal  roots.  Hence 

L[r]  =  (r-r,)V.  (7) 

Taking  partial  derivatives  of  both  sides  of  (7)  with  respect  to  r  gives 

Since  d(tr)/dr  =  tr\n  t ,  we  see  that 

L[/Tn  t]  =  (r  —  rx)2trln  t  +2(r  —  rx)tr.  (8) 

The  right  hand  side  of  (8)  vanishes  when  r  —  rx.  Hence, 

L[/r'lnf]=0 

which  implies  that  y2(t)  =  tr'\nt  is  a  second  solution  of  (2).  Since  tr 1  and 
tr'\nt  are  obviously  linearly  independent,  the  general  solution  of  (2)  in  the 
case  of  equal  roots  is 

y(t)  =  (q  +  c2ln  t)tr,  t>  0. 

Example  2.  Find  the  general  solution  of 

LM='2^r-5t%+9y=0’  t>0 •  (9) 

Solution.  Substituting  v  =  tr  in  (9)  gives 

L[tr]=  r(r  -l)tr -5rtr  +9tr 
=  [r(r-l)-5r+9]f 
=  (r2—6r  +9  )tr 
=  {r-3)2tr. 


200 


2.8  Series  solutions 


The  equation  (r  —  3)2  =  0  has  r  —  3  as  a  double  root.  Hence, 

and  the  general  solution  of  (9)  is 

y{t)  —  (cj  +  c2ln  t)t3 ,  t  >  0. 


Case  3.  (a  —  l)2  —  4)8  <  0.  In  this  case. 


r}  =  X  +  //a  and  r2  =  A  —  ifi 


with 


_l-«  _[4p-(a-l)2] 


1/2 


2  2 

are  complex  roots.  Hence, 

<j>(t)  =  =  txtitL 

=  t x  [cos(  /a  In  f )  + '/  sin(  /x  In  / )] 

is  a  complex- valued  solution  of  (2).  But  then  (see  Section  2.2.1) 


and 


y^t)  =Re{<J>(/)}  =  rxcos(/xln /) 
y2(/)  =  Im{<J>(*)}  =  txsin({x\nt) 


are  two  real-valued  independent  solutions  of  (2).  Hence,  the  general  solution 
of  (2),  in  the  case  of  complex  roots,  is 

y(t )  =  ^x[c1cos(/xln  t)  +  c2sin(juln  *)] 

with  X  and  [i  given  by  (10). 


Example  3.  Find  the  general  solution  of  the  equation 

L[y]  =  t*y"  —  5ty'  +  25y  =  0,  t>  0.  (11) 

Solution.  Substituting  v  =  tr  in  (11)  gives 

L[tr\  =  r(r-\)t'-5rtr+25t' 

=  [/•(/•  — l)  —  5r+ 25]  tr 
=  [r2  —6r  +  25]rr 
The  roots  of  the  equation  r2  —  6r  +25  =  0  are 

6±/36-100 
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so  that 

+  (/)  =  ,3  +  4i  =  r3/4i 

_  ^3e(ln/)4/  _  1 3^/(4 In/) 

=  f3[cos(41n  t)  +  /sin(41n  f)] 


is  a  complex- valued  solution  of  (11).  Consequently, 


and 


yx(t )  =  Re{<J>(f)}  =  r3cos(41nf) 


y2(t)  =  Im{4>(/)}  =  f3sin(41nf) 


are  two  independent  solutions  of  (11),  and  the  general  solution  is 

y(t )  =  f3[c1cos(41n/)+  c2sin(41n/)]  ,  t>  0. 

Let  us  now  return  to  the  case  t  <  0.  One  difficulty  is  that  tr  may  not  be 
defined  if  t  is  negative.  For  example,  (—  l)1/2  equals  /,  which  is  imaginary.  A 
second  difficulty  is  that  In  t  is  not  defined  for  negative  t.  We  overcome  both 
of  these  difficulties  with  the  following  clever  change  of  variable.  Set 

t  =  —  x,  x  >  0, 

and  let  y  —  u{x\  x  >  0.  Observe,  from  the  chain  rule,  that 


and 


dy  _  du  dx  _  _du 
dt  dx  dt  dx 

d2y  _  d_l  _du  \  _  d  (  du\dx  _  d2u 
dt2  dt  \  dx  J  dx\  dx  J  dt  dx2 


Thus,  we  can  rewrite  (2)  in  the  form 


+  pu  —  0 


or 


+  ax 


du 

dx 


T*  /?w  —  0, 


x  >  0 


But  Equation  (12)  is  exactly  the  same  as  (2)  with  t  replaced  by  x  and  y 
replaced  by  u.  Hence,  Equation  (12)  has  solutions  of  the  form 


u 


cxxr 1  +  c2xr2 
-  ( Cj  +  c2ln x)xr‘ 

[c1cos(filn  x)  4-  c2sin(juln  x)]  xx 
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depending  on  whether  (a— l)2— 4)8  is  positive,  zero,  or  negative.  Observe 
now  that 

x  —  —  t  —  \t\ 

for  negative  t .  Thus,  for  negative  t ,  the  solutions  of  (2)  have  one  of  the  forms 

c1|/|''  +  c2|/|'* 

-  [c,  -t-  c2ln|/|  ]  |/| r' 

[c,cos(juln | / 1 )  +  c2sin(juln|f  |)]  |<|x 


Remark.  The  equation 

is  also  an  Euler  equation,  with  a  singularity  at  t  —  tQ  instead  of  t  —  0.  In  this 
case  we  look  for  solutions  of  the  form  ( t  —  t0)r .  Alternately,  we  can  reduce 
(14)  to  (2)  by  the  change  of  variable  x  —  t  — 10. 


Exercises 


In  Problems  1-8,  find  the  general  solution  of  the  given  equation. 

1.  t2y"  +  Sty'  —  5y  =  0  2.  2t2y"  +  3ty'  —  y  =  0 


3.  (t-\)2y"-2(t-\)y'  +  2y  =  0 
5.  t2y"  —  ty' +  y  =  0 
7.  t2y”+ty'+y  =  0 

9.  Solve  the  initial-value  problem 


4.  t2y''  +  3ty'+y  =  0 

6.  (/  — 2)V'  +  5(/  — 2)/  +  4j;  =  0 

8.  t2y"  +  3ty'  +  2y  =  0 


fy'-f/-2y  =  0;  H0  =  0,  /(1)  =  1 
on  the  interval  0  <  t  <  oo . 

10.  Solve  the  initial-value  problem 

t2y"-3ty'+4y  =  Q ;  ^(1)  =  1,  /(1)  =  0 

on  the  interval  0  <  t  <  oo . 

11.  Use  the  method  of  reduction  of  order  to  show  thaty2(/)  =  /''•In  t  in  the  case  of 
equal  roots. 

2.8.2  Regular  singular  points ,  the  method  of  Frobenius 

Our  goal  now  is  to  find  a  class  of  singular  differential  equations  which  is 

more  general  than  the  Euler  equation 


ldly+«i%  +0y  =  O 


dt 


(1) 
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but  which  is  also  solvable  by  analytical  techniques.  To  this  end  we  rewrite 
(1)  in  the  form 


di,+«<t+iy=() 


dt 


t  dt 


A  very  natural  generalization  of  (2)  is  the  equation 

L[y]  =  ^j  +  p(t)^  +  q(t)y  =  o 

where  p(t )  and  q(t )  can  be  expanded  in  series  of  the  form 

p(0  =  y  +  P\  +  Pit  +  P3t2+  ••• 

4(0  =  ^  +  7-  +  42  +  <73?  +  <Ja  (2  +  •  •  • 
t  1 


(2) 

(3) 

(4) 


Definition.  The  equation  (3)  is  said  to  have  a  regular  singular  point  at  t  —  0 
if  p(t)  and  q(t)  have  series  expansions  of  the  form  (4).  Equivalently,  t  =  0 
is  a  regular  singular  point  of  (3)  if  the  functions  tp(t)  and  t2q(t)  are 
analytic  at  t  =  0.  Equation  (3)  is  said  to  have  a  regular  singular  point  at 
t  =  t0if  the  functions  (t  -  t0)p(t)  and  ( t  -  t0)2q(t)  are  analytic  at  t  =  tQ. 
A  singular  point  of  (3)  which  is  not  regular  is  called  irregular. 


Example  1.  Classify  the  singular  points  of  Bessel’s  equation  of  order  v 


where  v  is  a  constant. 

Solution.  Here  P(t)  =  t 2  vanishes  at  t  =  0.  Hence,  t  —  0  is  the  only  singular 
point  of  (5).  Dividing  both  sides  of  (5)  by  t 2  gives 


d2y  1  dy 
dt 2  t  dt 


y  =  o. 


Observe  that 


tp(t)  =  l  and  t2q(t)  —  t2  —  v2 

are  both  analytic  at  /  =  0.  Hence  Bessel’s  equation  of  order  v  has  a  regular 
singular  point  at  t  =  0. 


Example  2.  Classify  the  singular  points  of  the  Legendre  equation 

(l-t2)^J-2t^  +  a(oc+l)y  =  0  (6) 

where  a  is  a  constant. 

Solution.  Since  1  —  t2  vanishes  when  /  =  1  and  —1,  we  see  that  (6)  is 
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singular  at  t  —  ±  1.  Dividing  both  sides  of  (6)  by  1  —  t2  gives 

d2y  It  dy  ,  (a  +  1)  „ 

-  +a- - T-y  —  0. 


dt2  l-t2  dt  \-t 


Observe  that 


It 


It 


and 


1  -t2  1  +  * 

2 


(f- 1  )2q(  t  )  =  «(«+ 1 )  =  «(«+ 1 )  777 

are  analytic  at  t  —  1.  Similarly,  both  ( t  +  and  ( t  +  l)2q(t)  are  analytic 
at  t  —  —  1.  Hence,  t  =  1  and  t  =  —  \  are  regular  singular  points  of  (6). 


Example  3.  Show  that  /  =  0  is  an  irregular  singular  point  of  the  equation 


t 


2  d  y  , 


“h  3  — - — h  t y  —  0 . 
dt 2  dt  7 


(7) 


Solution.  Dividing  through  by  t2  gives 

d2y  ,  3  dy  ,  1  _ 


In  this  case,  the  function 


,  +  --*-  +  -y  =  0. 
dt 2  t2  dt  ty 


,  v  /  3  \  3 

#(0  =  M  —  =T 
\  r  /  ^ 


is  not  analytic  at  t  —  0.  Hence  t  =  0  is  an  irregular  singular  point  of  (7). 


We  return  now  to  the  equation 

L[y]  =  ^;  +  p(t)^  +  <i(t)y  =  Q  (8) 

where  t  —  0  is  a  regular  singular  point.  For  simplicity,  we  will  restrict 
ourselves  to  the  interval  />0.  Multiplying  (8)  through  by  t 2  gives  the 
equivalent  equation 

L[y]  =  t2^;  +  t{tp(t))^  +  t2q(t)y  =  0.  (9) 

We  can  view  Equation  (9)  as  being  obtained  from  (1)  by  adding  higher 
powers  of  t  to  the  coefficients  a  and  13.  This  suggests  that  we  might  be  able 
to  obtain  solutions  of  (9)  by  adding  terms  of  the  form  /r+1,  tr+ 2,...  to  the 
solutions  tr  of  (1).  Specifically,  we  will  try  to  obtain  solutions  of  (9)  of  the 
form 

00  00 

y{t)=  2  antn+r  =  f  2  ant\ 

n— 0  n = 0 
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Example  4.  Find  two  linearly  independent  solutions  of  the  equation 


L[y)  =  lt 


_  ~,<d2y  ,  dy 


,  +^  +  ty  =  0, 

dt2  dt 


0  <  t  <  oo. 


Solution.  Let 

00 

y(t)=  2  antn+r,  a0^0. 

«  =  0 

Computing 

00 

y'{t)=  2  (n  +  r)ant"+r~' 

n  =  0 

and 

00 

y"(0~  2  ^  —  2 

«  =  o 

we  see  that 


00 


2  2  («  +  r)(n  +  /--l)a„*n 


-1 


oo 


00 


+  2  ( n  +  r)aHt--'+  2  antn+x 

n  —  0  n  =  0 


00 


2  2  {n  +  r){n  +  r  -\)antn  1 
L  n  =  0 


00  00 

+  2  (n  +  r)antn~'+  2  an-2tn~l 

n  —  0  n  =  2 

—  [2r(r  —  1  )a0  +  ra0]tr~l  H-  [2(1  -h  r)rax  +(1  +  r)ax] tr 
00 

+  2  [2(n  +  r)(n  +  r-l)an  +  (n  +  r)an  +  a„_2]tn+r-1 

n  =  2 


Setting  the  coefficients  of  each  power  of  t  equal  to  zero  gives 

(i)  2 r(r  —  1  )a0  +  ra0  =  r(2r  —  1  )a0  —  0, 

(ii)  2(r  +  1  )rax  +  (r  + 1)^  =  (r  +  l)(2r  + 1  )ax  =  0, 
and 

(iii)  2(/i  +  r)(w  -hr  -  l)a„  +  («  +  =  (n  +  r)[2(w  +  r)~ \]an  =  -  an_2, 


The  first  equation  determines  r;  it  implies  that  r  =  0  or  r  —  \.  The  second 
equation  then  forces  ax  to  be  zero,  and  the  third  equation  determines  an  for 


n 


z,. 


206 


2.8  Series  solutions 


(i)  r  —  0.  In  this  case,  the  recurrence  formula  (iii)  is 

—  a  o 

n  —  2 


<*n  = 


n(2n  —  1)  ’ 


2. 


Since  ax  —  0,  we  see  that  all  of  the  odd  coefficients  are  zero.  The  even 
coefficients  are  determined  from  the  relations 


a2  = 


a0  _  a2 
2-3  ’  4-7 


a 


o 


a , 


—  a 


2-4*3*7  ’ 

and  so  on.  Setting  a0  =  1,  we  see  that 


a <  = 


o 


6-11  2-4-6-3-7-11 


t 


Ti(0  =  1_TT  + 


t 


00 


2-3  2-4-3-7 


+ 


=i+  2 


(-in 


”  +  2  n 


nZx  2nn\3-l  •  •  •  (4n  —  1) 


is  one  solution  of  (10).  It  is  easily  verified,  using  the  Cauchy  ratio  test,  that 
this  series  converges  for  all  t. 


(ii)  r  =  \.  In  this  case,  the  recurrence  formula  (iii)  is 

—  Cl  o  —  (2  o 

_  n  —  2  _  n~2 


a 


n 


(«  +  ^)[2(n  +  ^)-l]  n(2n  + 1)  ’ 


n>2. 


Again,  all  of  the  odd  coefficients  are  zero.  The  even  coefficients  are 
determined  from  the  relations 


a 


o 


_  -a2 

61 4  4*9 


a 


0 


a 


2*4*5*9  ’ 

and  so  on.  Setting  a0  =  1,  we  see  that 


a*  = 


4  _ 


—  a 


o 


6-13  2-4-6-5-9-13 


=  ,1/2 


1- 


t 


+ 


f 


2-5  2-4-5-9 


+ 


00 


i+  2 


(-in 


nu2n 


n  ==  1 


2nn\5-9  •  •  •  (4«  + 1) 


is  a  second  solution  of  (10)  on  the  interval  0  <  t  <  oo. 


Remark.  Multiplying  both  sides  of  (10)  by  /  gives 

2, ^  +  4  +  ^  =  0. 

dt2  dt  7 

This  equation  can  be  viewed  as  a  generalization  of  the  Euler  equation 

2,2^  + 4  =  0. 

dt 2  dt 


(11) 


Equation  (11)  has  solutions  of  the  form  tr ,  where 

2  r(r  —  1)  +  r  =  0. 

This  equation  is  equivalent  to  Equation  (i)  which  determined  r  for  the 
solutions  of  (10). 
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Let  us  now  see  whether  our  technique,  which  is  known  as  the  method  of 
Frobenius,  works  in  general  for  Equation  (9).  (We  will  assume  throughout 
this  section  that  t  >  0.)  By  assumption,  this  equation  can  be  written  in  the 
form 


L\y]  =t2~^;  +  t[Po  +  P\l  +  P2{2+  •  ••  ]^  +  [4o  +  <7i*  +  42?2+  •  ••  ]y-0. 

Set 

00 

y(t)=  2  antn+r,mthao¥=0. 

n  =  0 

Computing 

00 

/(')=  2  ( n  +  r)a„tn+r 

n  =  0 

and 

00 

y"(t)—  2  (n  +  r)(w  + r  ~ \)antn+r~2 

n  =  0 

we  see  that 


L[y]  =  T 


00 


2  («  +  r)(w  +  r  —  l)tfn/n  + 

n  =  0 


'  00  ^ 

00 

2  p„tm 

2  ( n  +  r)antn 

\  m  —  0  > 

.  n  =  0 

1  00  \ 

r  00  \  ' 

+ 

2 

2  v"  * 

\  m  —  0  / 

l/i  =  0  /  ^ 

Multiplying  through  and  collecting  terms  gives 

L[y]  =  [r(r-l)  +  p0r  +  q0]a0tr 

+  { [( 1  +  r  )r  +  PoC 1  +  r  )  +  ?0]  a  i  +  (. rp  ,  +  q  , ), a0 }  t r+ 1 


+  ■  [(n  +  r){n  +  r-\)  +  p0(n  +  r)  +  q0]an 

n  —  1 

+  2  [(k  +  r)pn_k  +  q„^k]ak 

k  =  0 

+  •  •  •  . 

This  expression  can  be  simplified  if  we  set 

F(r)  =  r{r-l)  +  p0r  +  q0.  (12) 
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Then, 


L[y]  =  aoF{r)tr  +[axF(\  +  r)  +  {rp\  +  qx)a<]t'+r  + 

( 

+  anF(n  +  r)tn+r  + 


2  [(k  +  r)pn-k  +  <ln-k]aj 
k  =  0 


U 


n  +  r 


+ 


Setting  the  coefficient  of  each  power  of  t  equal  to  zero  gives 


F(r)  =  f(r- 1)  +  p0r  +  q0  =  0 


(13) 


and 


n  —  1 


F(n  +  r)an  =  -  2  [(k  + r)pn_k  + qn_k]ak,  n^l.  (14) 

k  =  0 

Equation  (13)  is  called  the  indicial  equation  of  (9).  It  is  a  quadratic 
equation  in  r,  and  its  roots  determine  the  two  possible  values  rx  and  r2  of  r 
for  which  there  may  be  solutions  of  (9)  of  the  form 

00 

2  °ntn+r ■ 

n  —  0 

Note  that  the  indicial  equation  (13)  is  exactly  the  equation  we  would  obtain 
in  looking  for  solutions  tr  of  the  Euler  equation 

2d2y  *  dy  . 
f  — -+Pot~£+%y-()- 


dt 


Equation  (14)  shows  that,  in  general,  an  depends  on  r  and  all  the  preceding 
coefficients  a0,  ax,...,an_x.  We  can  solve  it  recursively  for  an  provided  that 
F(1  +  r),  F(2+  r), . . . ,F(n  -I-  r )  are  not  zero.  Observe  though  that  if  F(n  +  r ) 
=  0  for  some  positive  integer  «,  then  n  +  r  is  a  root  of  the  indicial  equation 
(13).  Consequently,  if  (13)  has  two  real  roots  rX9  r2  with  rx  >r2  and  rx  —  r2  not 
an  integer,  then  Equation  (9)  has  two  solutions  of  the  form 

00  00 
y\{t)  =  tr'  2  an(r\)tn^y2(t)  =  t''2  2  a„(r2)tn, 

n  —  0  n  —  0 

and  these  solutions  can  be  shown  to  converge  wherever  tp(t )  and  t2q(t) 
both  converge. 


Remark.  We  have  introduced  the  notation  an(rx)  and  an(r2 )  to  emphasize 
that  an  is  determined  after  we  choose  r  —  rx  or  r2. 

Example  5.  Find  the  general  solution  of  the  equation 


L^i=4'$+3f+^=o- 
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Solution.  Equation  (15)  has  a  regular  singular  point  at  t  =  0  since 

tp(t)  =  1  and  t2q{t)—\t 
are  both  analytic  at  t  —  0.  Set 

00 

X0=  2  <V"+r,  a0^°- 

w  =  0 

Computing 

00 

y\t)=  2  (n  +  r )antn+r~l 

n  =  0 

and 

00 

2  {n  +  r)(n  +  r  -  \)antn+r~2 

n  =  o 

we  see  that 


00 


L[y]  =  4  2  ( n  +  r)(n  +  r-l)a„t”+r 
>2  =  0 

00  00 

+  3  2  («  +  /•)a„2"+r_1  +  3  2  antn+r 

n  —  0  n  —  0 

oo 

=  2  [4(«  +  r)(«  +  r  —  \)  +  3(n  +  r)]antn+r~l  +  2  3an_xtn 


00 


+  r-l 


>2  =  0 


>2  =  1 


Setting  the  sum  of  coefficients  of  like  powers  of  t  equal  to  zero  gives 

4  r(r  —  l)  +  3r  =  4r2  —  r  —  r(4r  —  1)  =  0  (16) 

and 

[4(n  +  r)(n  +  r  — l)  +  3(«  +  r)]an  =  (n  +  r)[4(n  +  r)  —  l]an  =  —3  an_l9 

n>  1.  (17) 

Equation  (16)  is  the  indicial  equation,  and  it  implies  that  r  —  0  or  r  =  }. 
Since  these  roots  do  not  differ  by  an  integer,  we  can  find  two  solutions  of 
(15)  of  the  form 

00 

2  a„t"+r 

n  =  0 

with  an  determined  from  (17). 

r  =  0.  In  this  case  the  recurrence  relation  (17)  reduces  to 

a  — 3 _ an—  1 _ __  ^an—  1 

n  4n(n~\)  +  3n  n(4n  —  l) 
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Setting  a0  —  1  gives 


and,  in  general, 


Hence, 


-  i  -  -3a>  1 

ax  —  —  1,  ci  2  —  —  3 


—  3  a 


a?>  3*11 

—  3a 


-  =  —32 


2-7  “  2-7  ’ 

1 


4-15 


3  _  o3 


2-3*7* 11  ’ 

1 

2*3*4-7*11*15  ’ 


a 


(  —  1)3 


n  on  —  1 


"  « !7  •  1 1  - 15  *  -  -  (4n  —  1) 


00 


>,i(0=  2 


(  —  1)  3 


n  on  —  1 


72  =  0 


«!7- 11  •  15  •  •  •  (4n  —  1) 


tn 


is  one  solution  of  (15).  It  is  easily  seen,  using  the  Cauchy  ratio  test,  that 
yx(t)  converges  for  all  t.  Hence yx(t)  is  an  analytic  solution  of  (15). 

r  —  \.  In  this  case  the  recurrence  relation  (17)  reduces  to 

_ _ 1 _ _  3an—  1  >  1 

(»+i)[4(»-t)+3]~»(4H  +  i)’  - 

Setting  a0  =  1  gives 

_  -3  32  _  —  33 

a\~  5  .  fl2_2-5-9’  °3~  2-3-5-9-13  ’ 

34 

°4~  2-3-4-5-9- 13- 17  *  ”  ’  ' 


Proceeding  inductively,  we  see  that 


Hence, 


a 


(— 1)”3” 


n 


n  !5*9*  13  •  •  •  (4 n  + 1) 


00  / _ i\w  on 

y2(t)  =  tl/4  2  ' ,,  q  n - 

AI  =  0  A2I5-9-13  -  •  •  (4 n  -1-1) 


is  a  second  solution  of  (15).  It  can  easily  be  shown,  using  the  Cauchy  ratio 
test,  that  this  solution  converges  for  all  positive  t.  Note,  however,  that  y2(t) 
is  not  differentiable  at  t  —  0. 

The  method  of  Frobenius  hits  a  snag  in  two  separate  instances.  The  first 
instance  occurs  when  the  indicial  equation  (13)  has  equal  roots  rx  —  r2.  In 
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this  case  we  can  only  find  one  solution  of  (9)  of  the  form 

00 

y\(t)  =  tr'  2  antn . 

n  =  0 

In  the  next  section  we  will  prove  that  (9)  has  a  second  solution  y2(t)  of  the 
form 

00 

y2(t)  =  +  2  b„tn 

n  =  0 

and  show  how  to  compute  the  coefficients  bn.  The  computation  of  the  bn  is 
usually  a  very  formidable  problem.  We  wish  to  point  out  here,  though,  that 
in  many  physical  applications  the  solution  y2(t)  is  rejected  on  the  grounds 
that  it  is  singular.  Thus,  it  often  suffices  to  find  yx(t)  alone.  It  is  also 
possible  to  find  a  second  solution y2(t)  by  the  method  of  reduction  of  order, 
but  this  too  is  usually  very  cumbersome. 

The  second  snag  in  the  method  of  Frobenius  occurs  when  the  roots  rX9  r2 
of  the  indicial  equation  differ  by  a  positive  integer.  Suppose  that  rx  —  r2  +  N, 
where  N  is  a  positive  integer.  In  this  case,  we  can  find  one  solution  of  the 
form 

00 

yM  =  *r'  2 

n  =  0 

However,  it  may  not  be  possible  to  find  a  second  solution  y2(t)  of  the  form 

00 

y2{t)  =  tr*  2  b„tn. 

/!  =  0 

This  is  because  F(r2  +  n)  =  0  when  n  =  N.  Thus,  the  left  hand  side  of  (14) 
becomes 

N-  1 

0-aN  =  -  2  [{k  +  r2)pN_k  +  qN_^ak  (19) 

k  =  0 

when  n  —  N.  This  equation  cannot  be  satisfied  for  any  choice  of  aN ,  if 

N-  1 

2  [(k  +  r2)pN-k  +  qN-k]ak^0. 

k  =  0 

In  this  case  (see  Section  2.8.3),  Equation  (9)  has  a  second  solution  of  the 
form 

00 

yi{t)  =  y\{t)\nt  +  tr'  2  bJ" 

n  =  0 

where  again,  the  computation  of  the  bn  is  a  formidable  problem. 
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On  the  other  hand,  if  the  sum  on  the  right  hand  side  of  (19)  vanishes, 
then  aN  is  arbitrary,  and  we  can  obtain  a  second  solution  of  the  form 

00 

2  V" • 

n  =  0 

We  illustrate  this  situation  with  the  following  example. 


Example  6.  Find  two  solutions  of  Bessel’s  equation  of  order 

t2-—^r  +  t^j-  +  it2  —  1/4)  y  =  0,  0</<oo. 
dt 2  dt  v  7 

Solution.  This  equation  has  a  regular  singular  point  at  /  =  0  since 

(P(0  =  1  and  t2q(t)  —  t2  —  \ 
are  both  analytic  at  t  —  0.  Set 

00 

y(t)=  2  “ntn+r, 

n  =  0 

Computing 

00 

y'U)=  2  (n  +  r )antn+r~i 

n  =  0 

and 

00 

y"(t)=  2  {n~\~  r){n  +  r  —  \)antn+r~2 

n  =  0 

we  see  that 


00  00 
£[.y]=  2  («  +  r)(w  +  r~l)*Vw+r  +  2  (n  +  r)antn+r 

n  —  0  rc  =  0 

00  00 

+  2  antn+r+1-\  2  antn+r 

n  =  0  n  =  0 

00  00 

=  2  [{n  +  r){n  +  r  —  \)  +  (n  +  r)  —  \\antn+r  +  2  a„-itn+r ■ 

n= 0  n= 2 

Setting  the  sum  of  coefficients  of  like  powers  of  t  equal  to  zero  gives 

F(r)ao  -  ['•('■  -  1)  +  r  ~  i]  a0  =  (r2  -  $)a0  =  0  (i) 

i;’(l  +  r)ai  =  [(l  +  r)r  +  (l  +  r)-  i]a,  =  [(1  +  rf  -  |]a,  =  0  (ii) 

and 

F{n  +  r)an  =  [{n  +  rf -\]an  =  -a„_2,  n>  2  (iii) 

Equation  (i)  is  the  indicial  equation,  and  it  implies  that  rx  =  j,  r2  =  — 
rx  —  \\  Set  a0  =  \.  Equation  (ii)  forces  a,  to  be  zero,  and  the  recurrence 
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relation  (iii)  implies  that 


a„_ 


a 


n  —  2 


a 


n  —  2 


n 


2. 


F(n  +  n(n  +  1) ’ 

This,  in  turn,  implies  that  all  the  odd  coefficients  a3,  as,...,  are  zero,  and 
the  even  coefficients  are  given  by 

-1  _  1 
2-3“  3! 

~  a2  _  1  __  1 


a 


Cl  j  _ 

2  2-3 


o 


a4  ~ 


“6  = 


4-5  2-3-4-5 

-a4_  -1 


6-7  2-3-4-5-6-7 

and  so  on.  Proceeding  inductively,  we  see  that 


1 

7! 


flU  — 
In 


(-1)" 


(2h)!(2h  +  1) 


Hence 


>',(0  =  ^' 17  (l~  37+ 57- tT  + 


is  one  solution  of  (20).  This  solution  can  be  rewritten  in  the  form 


ji(0  = 


1/2 

/ 


t- 


iL  +  lL 

3!  5! 


7!  + 


1  . 
sin  t. 


ft 


r2  =  ~  2 :  Set  =  Since  1  +  r2  =  {  is  also  a  root  of  the  indicial  equation, 
we  could,  conceivably,  run  into  trouble  when  trying  to  solve  for  av 
However,  Equation  (ii)  is  automatically  satisfied,  regardless  of  the  value  of 
ci\.  We  will  set  ax  =  0.  (A  nonzero  value  of  ax  will  just  reproduce  a  multiple 
of  Ti(0)-  The  recurrence  relation  (iii)  becomes 


—  a 


an  = 


n  —  2 


—  a 


n  —  2 


a 


n  —  2 


("-2)' 


n2  -  n  «(«-!)’ 


n>  2. 


All  the  odd  coefficients  are  again  zero,  and  the  even  coefficients  are 


✓7  —  010  —  — 

2  2-1 

_  -  a2  _  1 

°4  ~  4-3  ~  4!" 


1 


2! 


a. 


a , 


6-5 


1 

6! 
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and  so  on.  Proceeding  inductively,  we  see  that 

(-D" 

(2«)!  ' 


Hence, 


flu  = 

2n 


(  /4  j.6 

>JW  =  rvJ(,-lT  +  if-i7  + 


COS  / 


is  a  second  solution  of  (20). 

Remark  1.  If  r  is  a  complex  root  of  the  indicial  equation,  then 


00 


y(t)  =  tr  2  ant* 

n  =  0 

is  a  complex-valued  solution  of  (9).  It  is  easily  verified  in  this  case  that  both 
the  real  and  imaginary  parts  of  y(t)  are  real- valued  solutions  of  (9). 


Remark  2.  We  must  set 

00 

y(0  =  l*r  2  antn 

n  =  0 

if  we  want  to  solve  (9)  on  an  interval  where  t  is  negative.  The  proof  is 
exactly  analogous  to  the  proof  for  the  Euler  equation  in  Section  2.8.1,  and  is 
left  as  an  exercise  for  the  reader. 

We  summarize  the  results  of  this  section  in  the  following  theorem. 

Theorem  8.  Consider  the  differential  equation  (9)  where  t  —  0  is  a  regular 
singular  point.  Then ,  the  functions  tp(t)  and  t2q(t)  are  analytic  at  t  =  0 
with  power  series  expansions 

tp(t)  =  Po  +  Pit  +  p2t2+  ■  ■  ■  ,t2q(t)  =  q0  +  q1t  +  q2t2+  ••• 

which  converge  for  \  t  \  <  p.  Let  r{  and  r2  be  the  two  roots  of  the  indicial 
equation 

r(r-l)+  por  +  qo  =  0 

with  rx  >  r2  if  they  are  real.  Then ,  Equation  (9)  has  two  linearly  independent 
solutions  yx{t)  and  y2(t)  on  the  interval  0  <t<p  of  the  following  form : 

(a)  If  rx  —  r2  is  not  a  positive  integer ,  then 

00  00 

yM  =  tr>  2  a„t\  y2(t)  =  s>  2  bntn. 

n  —  0  n  —  0 
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(b)  If  rx  —  r2,  then 

OO  00 

y\(t)  =  tr'  2  yi(t)  =  +  2  V"- 

n  =  0  «  =  0 

(c)  If  rx  —  r2  =  N,  a  positive  integer , 

00  00 
>>iO)  =  'r'  2  a„tn,  y2(t)  =  ayl(t)\nt  +  tr 2  2  V” 

«  =  0  n  =  0 

where  the  constant  a  may  turn  out  to  be  zero. 


Exercises 

In  each  of  Problems  1-6,  determine  whether  the  specified  value  of  t  is  a 
regular  singular  point  of  the  given  differential  equation. 


1.  t(t  —2)2y"  +  ty'+  y  =  0;  /  =  0 
3.  (sin  t)y"  + (cos  t)y'+  —  y  =  0;  /  =  0  4.  (e{ —  l)y"+ e*y'+ y  =  0;  t  —  0 


2.  —  2)2y"+  /y'-b  y  =  0;  t  =  2 


5.  (l-f2)/'+- 


1 


sin(/  4- 1) 

6.  /3j"  +  (sin  t2)y'+  ty  =  0;  r  =  0 


y  =  0;  t  —  —  1 


Find  the  general  solution  of  each  of  the  following  equations. 

7.  2t2y"-\~3tyf-{\  +  t)y  =  0  8.  2(y"  +  (l -20/- 7  =  0 

9.  2(y"  +  (l  +  0/-2.y  =  0  10.  2t2y”-ty'  +  (l  +  t)y  =  Q 

11.  4(y"  +  3y'  — 3y  =  0  12.  lt2y"  +  (t2  -  t)y'+  y  =  0 


In  each  of  Problems  13-18,  find  two  independent  solutions  of  the  given 
equation.  In  each  problem,  the  roots  of  the  indicial  equation  differ  by  a 
positive  integer,  but  two  solutions  exist  of  the  form  tr'2f=0antn. 

13.  t2y"—  ty'  —  (t2  +  %)y  =  0  14.  t2y"+  (t  —  t2)y' —  y  —  0 

15.  ty"  —  (t2  +2)y'+  ty  =  0  16.  t2y"  +  (3t  —  t2)y' —  ty  =  0 

17.  t2y"+t(t  +  \)y'-y  =  0  18.  r/'-(4  +  t)y’  +  2y  =  0 

19.  Consider  the  equation 

t2y"  +  (t2  —  3t)y'  +  3y  =  0  (*) 

(a)  Show  that  r  =  1  and  r  =  3  are  the  two  roots  of  the  indicial  equation  of  (*). 

(b)  Find  a  power  series  solution  of  (  * )  of  the  form 

00 

M0  =  ?3  2  a,'”.  a0  =  \. 

n  =  0 
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(c)  Show  that yx(t)  =  t3e  '. 

(d)  Show  that  ( * )  has  no  solution  of  the  form 

00 

t  2  bnt\ 

n  =  0 

(e)  Find  a  second  solution  of  ( * )  using  the  method  of  reduction  of  order.  Leave 
your  answer  in  integral  form. 

20.  Consider  the  equation 


t2y"+  ty'  —  (\  +  t)y  =  0. 


(a)  Show  that  r  =  —  1  and  r  —  1  are  the  two  roots  of  the  indicial  equation. 

(b)  Find  one  solution  of  the  form 

00 

yx(t)=t  2  antn. 

n  =  0 

(c)  Find  a  second  solution  using  the  method  of  reduction  of  order. 

21.  Consider  the  equation 

ty"  +  ty'  +  2y  =  0. 

(a)  Show  that  r  —  0  and  r  —  1  are  the  two  roots  of  the  indicial  equation. 

(b)  Find  one  solution  of  the  form 

00 

y\0)  =  *  2 

n  =  0 

(c)  Find  a  second  solution  using  the  method  of  reduction  of  order. 

22.  Consider  the  equation 


(y"  +  (  1  —  t2)y'  +  4ty  =  0. 


(a)  Show  that  r  =  0  is  a  double  root  of  the  indicial  equation. 

(b)  Find  one  solution  of  the  form yx{t)  —  ajn- 

(c)  Find  a  second  solution  using  the  method  of  reduction  of  order. 

23.  Consider  the  Bessel  equation  of  order  zero 


t2y"  +  ty'+  t2y  =  0. 


(a)  Show  that  r  —  0  is  a  double  root  of  the  indicial  equation. 

(b)  Find  one  solution  of  the  form 


t' 


22-42-62 


+ 


This  solution  is  known  as  /0(0- 

(c)  Find  a  second  solution  using  the  method  of  reduction  of  order. 
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24.  Consider  the  Bessel  equation  of  order  v 

t2y"+  ty'  +  (t2  —  v2)y  =  0 

where  v  is  real  and  positive. 

(a)  Find  a  power  series  solution 

v  00 
L  Vm  n  =  0 

This  function  Jv(t)  is  called  the  Bessel  function  of  order  v. 

(b)  Find  a  second  solution  if  2v  is  not  an  integer. 

25.  The  differential  equation 

ty”  +  {\  —  t)y'+  Xy  =  0,  A  constant, 

is  called  the  Laguerre  differential  equation. 

(a)  Show  that  the  indicial  equation  is  r2  —  0. 

(b)  Find  a  solution  y(t)  of  the  Laguerre  equation  of  the  form  2^=0  antn. 

(c)  Show  that  this  solution  reduces  to  a  polynomial  if  A  —  n. 

26.  The  differential  equation 

t{\  —  t)y"  +  [y  —  {\  +  a  +  P)t]y'~  afiy  —  0 

where  a,  /?,  and  y  are  constants,  is  known  as  the  hypergeometric  equation. 

(a)  Show  that  t  —  0  is  a  regular  singular  point  and  that  the  roots  of  the  indicial 
equation  are  0  and  1-y. 

(b)  Show  that  t  —  1  is  also  a  regular  singular  point,  and  that  the  roots  of  the 
indicial  equation  are  now  0  and  y  —  a  —  ft. 

(c)  Assume  that  y  is  not  an  integer.  Find  two  solutions  yx(t)  and  y2(t)  of  the 
hypergeometric  equation  of  the  form 

00  00 

*(0=  2  anr,  y2(t)  =  t'-'  2  b„t\ 

rt  —  0  n  —  0 

27.  (a)  Show  that  the  equation 

2(sin  /)y"  +  (l  —  t)y'  —  2y  —  0 

has  two  solutions  of  the  form 

00  00 

7i(0=  2  a„tn,  y2(t)  =  t>/2  2  bntn. 

n= 0  n = 0 

(b)  Find  the  first  5  terms  in  these  series  expansions  assuming  that  a0  —  b0  —  1. 

28.  Let  y(t)  =  u(t)+  iv(t)  be  a  complex-valued  solution  of  (3)  with  p(t)  and  q(t) 
real.  Show  that  both  u(t)  and  v(t)  are  real- valued  solutions  of  (3). 

29.  (a)  Show  that  the  indicial  equation  of 

t2y"+  tyf  +  (\  +  t)  y  =  0  (*) 

has  complex  roots  r=±i. 
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(b)  Show  that  (*)  has  2  linearly  independent  solutions y(t)  of  the  form 


00 


00 


y(t)  =  sin(ln  t)  2  antn  +cos(\nt)  2  bntn . 

n  —  0  n  =  0 

2.8.3  Equal  roots ,  and  roots  differing  by  an  integer 
Equal  roots. 

We  run  into  trouble  if  the  indicial  equation  has  equal  roots  r,  =  r2  because 
then  the  differential  equation 

p«^+e«f +*(<)>■ =o  (0 

has  only  one  solution  of  the  form 


00 


y(*)  =  tr  2  antn. 

n  =  0 


(2) 


The  method  of  finding  a  second  solution  is  very  similar  to  the  method  used 
in  finding  a  second  solution  of  Euler’s  equation,  in  the  case  of  equal  roots. 
Let  us  rewrite  (2)  in  the  form 


00 


y(t)  =  y(t ,  r)  =  tr  2  a„(r)tn 

n  =  0 

to  emphasize  that  the  solution  y(t)  depends  on  our  choice  of  r.  Then  (see 
Section  2.8.2) 

L[y]{t,r)  =  a0F{r)tr 

n  —  1 


oo 


+  2 

n  =  1 


an(r)F{n  +  r)+  2  [(£  +  r)pn-k  +  q„-k] 


a 


k  =  0 


Yt 


n  +  r 


We  now  think  of  r  as  a  continuous  variable  and  determine  a„  as  a  function 


of  r  by  requiring  that  the  coefficient  of  tn+r  be  zero  for  n  >  1.  Thus 


n 


a„(r) 


n  —  1 

2  [(k  +  r)p„_k  +  qn_^a, 

k  =  0 


F(n  +  r) 

With  this  choice  of  an(r ),  we  see  that 

L[y]{t,r)  =  aQF{r)tr. 


(3) 


In  the  case  of  equal  roots,  F(r)  =  (r  —  rx)2y  so  that  (3)  can  be  written  in  the 
form 

L[y]{t,r)  =  a0{r-rxftr. 

Since  L[y](t,  rx)  —  0,  we  obtain  one  solution 


^(0=^ 


00 


«0+  2  an(r l)f" 
n  =  1 
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Observe  now,  that 


9 y 

dr 


{t,r) 


= -^a0(r  -  rt)2 tr 
=  2a0(r  -  rl)tr  +  a0(r  -  ^ )2(ln/)f' 


also  vanishes  when  r  —  rv  Thus 

y2(*)  =  i;yi(t>r)\r=i 


dr 

d_ 

dr 

OO 


00 


2  an(r)t 

n  =  0 


n  +  r 


\r=rx 


oo 


=  2  [ an{rx)tn+r']\ru+  2  <(ri )t 

n— 0  n— 0 

oo 

=  yi(t)lnt+  2  <(ri)tn+r' 

n  =  0 


n  +  r. 


is  a  second  solution  of  (1). 

Example  1.  Find  two  solutions  of  Bessel’s  equation  of  order  zero 

-Aiii,!.- 


+  +  <'>  =  o,  <>o. 


Solution.  Set 


00 


y(t)=  2 

n  =  0 


Computing 


00 


/(0=  2  (»  +  »•)«,.* 

n  =  0 


n  +  r—  1 


and 


00 


/'(0=  2  («  +  /•)(/! +  r-l)a„f 

n  =  0 


n  +  r— 2 


we  see  that 


00 


00 


L[y]  =  2  {n  +  r)(n  +  r  —  \)antn+r+  2  (n  +  r)antn+r  + 

n= 0  n— 0  n 

00  00 

=  2  ( n  +  rfantn+r+  2  an_2tl 

n  =  0  n  =  2 


n  +  r 
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Setting  the  sums  of  like  powers  of  t  equal  to  zero  gives 

(i)  r2a0  =  F(r )a0  —  0 

(ii)  (1  +  r)2ax  =  ^(1  +  r)ax  =  0 
and 

(iii)  ( n  +  r)2a„  =  F(n  +  r)an  =  -  a„_2,  n>  2. 


Equation  (i)  is  the  indicial  equation,  and  it  has  equal  roots  rx  =  r2  =  0. 
Equation  (ii)  forces  ax  to  be  zero,  and  the  recurrence  relation  (iii)  says  that 

—  a . 


an  = 


n  —  2 


( n  +  r ) 


2  * 


Clearly,  a3  =  a5  =  a7  =  •  •  •  =0.  The  even  coefficients  are  given  by 


a2(r)  = 


a4(r)  = 


a 


o 


1 


(2  +rf  (2+  r  )' 


—  a 


(4+r)  (24-  r)2(4+  r)' 


and  so  on.  Proceeding  inductively,  we  see  that 


a2n(r)  = 


(-1 )" 


(2+  /•)2(4+  r)2  •  ■  ■  (2  n  +  r ) 
To  determine  yx(t),  we  set  r  —  0.  Then 

-1 


2  ’ 


a2(0)  = 
a4(0)  = 


a«(°)  = 


1  1  1 


22-42  24  (2\f 

-1  -1 


and  in  general 


«2  »(0) 


22-42-62  26(3!)2 


(-1)"  _  (-1)" 


22-42-  •  •  (2«)2  22"(n!) 


2  * 


Hence, 


t2  .  t 4  t6 


■vi(0-i“  +  „d  ^.x2 


22  24-(2!)z  26(3!)2 


+ 


=  |  (-l)V" 

n  =  o  2 2n(«!)2 

is  one  solution  of  (4).  This  solution  is  often  referred  to  as  the  Bessel  function 
of  the  first  kind  of  order  zero,  and  is  denoted  by  J0(t). 
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To  obtain  a  second  solution  of  (4)  we  set 


00 


>'2(0  =  ^i(0ln^+  2  a'2n(0)t2n. 

n  =  0 


To  compute  a2n( 0),  observe  that 

=  j:ln|«2.W|=|:ln(2+r) 


-2 


(2«  4  r ) 


2  —  [ln(2  4-  r )  4-  ln(4 -f  r )  4-  •  •  •  4-  ln(2«  4-  r )] 

2i  '  •  '  •  ••  •  ' 


24  r  44-  r 


In  4  r  /  ’ 


Hence, 


Setting 


a2«(0)-~2(^  +  ^+  +^)°  2n(°) 

=  -(l  +  ^+  •••  +  ~  ja2«(0)’ 


//„  —  l  +  ^r  +  —  +  •••  H - 

2  3  n 


n 


we  see  that 


and  thus 


-Hn(-\y  _(-\y+'Hn 

2  2n(n\f  22"(/i!)2 


00  /  1\W  +  1IJ 

>’2(0  =  3;i(/)ln'+  2  ^  ,  .  ,2  "?2" 

»=o  22"(n  !)2 


is  a  second  solution  of  (4)  with  Hn  given  by  (5). 

Roots  differing  by  a  positive  integer.  Suppose  that  r2  and  rx  —  r2  4-  N,  N  a 
positive  integer,  are  the  roots  of  the  indicial  equation.  Then  we  can  certainly 
find  one  solution  of  (1)  of  the  form 

00 

yM  =  tr'  £  a„(r,)/n. 

n  =  0 

As  we  mentioned  previously,  it  may  not  be  possible  to  find  a  second 
solution  of  the  form 

00 

2  bntn- 

n  =  0 
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In  this  case,  Equation  (1)  will  have  a  second  solution  of  the  form 

^(0  =  j; y (t’r) 


r  =  r2 
00 


=  ayx(t)\nt+  £  a'n(r2)tn 

n  =  0 

where  a  is  a  constant,  and 

00 

y(t,r)  =  tr  2  an(r)tn 


+  ri 


n  =  0 


with 


ao  =  ao(r)  =  r~r2 • 


The  proof  of  this  result  can  be  found  in  more  advanced  books  on  differen¬ 
tial  equations.  In  Exercise  5,  we  develop  a  simple  proof,  using  the  method  of 
reduction  of  order,  to  show  why  a  logarithm  term  will  be  present. 


Remark.  It  is  usually  very  difficult,  and  quite  cumbersome,  to  obtain  the 
second  solution  y2(t)  when  a  logarithm  term  is  present.  Beginning  and 
intermediate  students  are  not  expected,  usually,  to  perform  such  calcula¬ 
tions.  We  have  included  several  exercises  for  the  more  industrious  students. 
In  these  problems,  and  in  similar  problems  which  occur  in  applications,  it  is 
often  more  than  sufficient  to  find  just  the  first  few  terms  in  the  series 
expansion  of  y2(t ),  and  this  can  usually  be  accomplished  using  the  method 
of  reduction  of  order. 


Exercises 


In  Problems  1  and  2,  show  that  the  roots  of  the  indicial  equation  are  equal, 
and  find  two  independent  solutions  of  the  given  equation. 


1.  ty"  +  y'-4y  =  0 

2.  t2y"—  t(\  +  t)y'  +  y  —  0 


3.  (a)  Show  that  r  —  —  1  and  r  —  1  are  the  roots  of  the  indicial  equation  for  Bessel’s 
equation  of  order  one 

.2  d2y  ,  dy 


„2+^+('2-»,  =  0. 


(b)  Find  a  solution: 


00 


•M0  =  2'  2  a„tn,a0  =  \ 

n  =  0 


Jx(t)  is  called  the  Bessel  function  of  order  one. 
(c)  Find  a  second  solution: 

r 

00 


J_  Y  (-1  ;2„ 

22"h  !(«  —  !)! 
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4.  Consider  the  equation 

ty"  +  3y'  ~3y  =  0,  t>  0. 

(a)  Show  that  r  —  0  and  r  —  —  2  are  the  roots  of  the  indicial  equation. 

(b)  Find  a  solution 

00 

>>,(0=  2 

n  =  0 

(c)  Find  a  second  solution 

/  \ _  /  \i  ,1  1  .  1  ,  11  ,  31  2  , 

y2(t) -y}(t)lnt +  -£--  + 4+ +  +•••. 

5.  This  exercise  gives  an  alternate  proof  of  some  of  the  results  of  this  section,  using 
the  method  of  reduction  of  order. 

(a)  Let  t  =  0  be  a  regular  singular  point  of  the  equation 

t2y"+tp(t)y'+  q(t)y  =  0  (i) 

Show  that  the  substitution  y  =  trz  reduces  (i)  to  the  equation 

t2z"  +  \2r  +  p(t)]tz'  +  [r(r  —  1)  +  rp(t)  +  q(t)]z  =  0.  (ii) 

(b)  Let  r  be  a  root  of  the  indicial  equation.  Show  that  (ii)  has  an  analytic 
solution  zx{t)  —  l^=Qantn. 

(c)  Set  z2(t)  —  zx(t)v(t).  Show  that 

/•  e~f[2r+p(t)]/tdt 

v(t)  =  I  u(t)  dt,  wherew(/)=: - - - . 

J  zfO) 

(d)  Suppose  that  r  —  r0  is  a  double  root  of  the  indicial  equation.  Show  that 

+  Po  =  U  and  conclude  therefore  that 

w(f)  —  — -  +  +  U2t  +  •**  . 

/ 

(e)  Use  the  result  in  (d)  to  show  that  y2(t )  has  an  In  t  term  in  the  case  of  equal 
roots. 

(f )  Suppose  the  roots  of  the  indicial  equation  are  r0  and  rQ  —  N,  N  a  positive 
integer.  Show  that  2  r0  +  p0  =  1  +  N,  and  conclude  therefore,  that 

where  u(t)  is  analytic  at  t  —  0. 

(g)  Use  the  result  in  (f)  to  show  that  y2(t)  has  an  In  t  term  if  the  coefficient  of  tN 
in  the  expansion  of  u(t)  is  nonzero.  Show,  in  addition,  that  if  this  coefficient 
is  zero,  then 

v ( =  — 4-  •  •  •  H - -  -\-  v xt  v2t2  •  •  • 

v  f  tN  t  ii 

and  y2(t)  has  no  In  /  term. 
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2.9  The  method  of  Laplace  transforms 

In  this  section  we  describe  a  very  different  and  extremely  clever  way  of 
solving  the  initial-value  problem 

tf^  +  ^  +  cy=/(0;  ,«>)-,»  y'(0)=y'Q  (1) 

where  a ,  b  and  c  are  constants.  This  method,  which  is  known  as  the 
method  of  Laplace  transforms,  is  especially  useful  in  two  cases  which  arise 
quite  often  in  applications.  The  first  case  is  when  f(t)  is  a  discontinuous 
function  of  time.  The  second  case  is  when  /(/)  is  zero  except  for  a  very 
short  time  interval  in  which  it  is  very  large. 

To  put  the  method  of  Laplace  transforms  into  proper  perspective,  we 
consider  the  following  hypothetical  situation.  Suppose  that  we  want  to 
multiply  the  numbers  3.163  and  16.38  together,  but  that  we  have  forgotten 
completely  how  to  multiply.  We  only  remember  how  to  add.  Being  good 
mathematicians,  we  ask  ourselves  the  following  question. 

Question :  Is  it  possible  to  reduce  the  problem  of  multiplying  the  two  num¬ 
bers  3.163  and  16.38  together  to  the  simpler  problem  of  adding  two  num¬ 
bers  together? 

The  answer  to  this  question,  of  course,  is  yes,  and  is  obtained  as  follows. 
First,  we  consult  our  logarithm  tables  and  find  that  ln3.163  =  1.15152094, 
and  In  16.38  =  2.79606108.  Then,  we  add  these  two  numbers  together  to 
yield  3.94758202.  Finally,  we  consult  our  anti-logarithm  tables  and  find 
that  3.94758202  =  In 5 1.80994.  Hence,  we  conclude  that  3.163x  16.38  = 
51.80994. 

The  key  point  in  this  analysis  is  that  the  operation  of  multiplication  is 
replaced  by  the  simpler  operation  of  addition  when  we  work  with  the  loga¬ 
rithms  of  numbers,  rather  than  with  the  numbers  themselves.  We  represent 
this  schematically  in  Table  1.  In  the  method  to  be  discussed  below,  the  un¬ 
known  function  y(t)  will  be  replaced  by  a  new  function  Y (s),  known  as  the 
Laplace  transform  of  y(t).  This  association  will  have  the  property  that  y'(0 
will  be  replaced  by  sY  Cs)-y(0).  Thus,  the  operation  of  differentiation  with 
respect  to  /  will  be  replaced,  essentially,  by  the  operation  of  multiplication 
with  respect  to  s.  In  this  manner,  we  will  replace  the  initial-value  problem 
(1)  by  an  algebraic  equation  which  can  be  solved  explicitly  for  Y(s).  Once 
we  know  Y  ( s ),  we  can  consult  our  “anti-Laplace  transform”  tables  and  re¬ 
cover  y(t). 

Table  1 

a  — ^  In  a 

b  — >  In  b 

a-b  — ^  \na  +  \nb 
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We  begin  with  the  definition  of  the  Laplace  transform. 

Definition.  Let  f(t)  be  defined  for  0<  t<oo.  The  Laplace  transform  of 
f(t),  which  is  denoted  by  F(s),  or  £{/(/)),  is  given  by  the  formula 

F(s)  =  £{/(/)}=  f  e~s'f(t)dt  (2) 

where 


lim  f  e  stf(t)dt. 
— >oo  J  o 


Example  1.  Compute  the  Laplace  transform  of  the  function  /(/)=  1. 
Solution.  From  (2), 

£{/(0)=  f  e  st dt=  lim  - — - - 

1  J  A—>oo  Jq  A-> oo  .v 


00, 


^>0 
s  <0 


Example  2.  Compute  the  Laplace  transform  of  the  function  eat . 
Solution.  From  (2), 

^  „  rA  p(a-s)A_  1 

t{eat}=  lim  f  e  steatdt=  lim  - 

A  — >oo  J 0  A— >oo  Cl  S 


<  s  —  a 
oo. 


s>  a 
s<a 


Example  3.  Compute  the  Laplace  transform  of  the  functions  cosc ot  and 
sin  oit. 

Solution.  From  (2), 


r  00  _  r  00 

£{coswf}  =  I  e  5'cos o)tdt  and  £{sinwf}  =  I  e~stsincotdt. 

•'O  *A) 


Now,  observe  that 


£{cosc*tf}  +  /£{sincof}  =  f 

•'o 


00 


=  lim 

A— >00 


e~s,eiu,dt=  lim 

A— >00 

e(iu-s)A  __  j 


/' 

*/0 


,(/«- j)f 


10)  — s 


1  _  S  +  /  CO 

<  5  —  /CO  ty2  +  (02  s 

.undefined, 


^>0 

£<0 
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Equating  real  and  imaginary  parts  in  this  equation  gives 

£(cos<of|  =  ■■■  -  -  and  £{sincof|  =  — < — s> 0. 

1  J  s2  +  w2  1  $2  +  <o2 

Equation  (2)  associates  with  every  function  f(t)  a  new  function,  which 
we  call  F(s).  As  the  notation  £{/(f)}  suggests,  the  Laplace  transform  is  an 
operator  acting  on  functions.  It  is  also  a  linear  operator,  since 

£{c\f\(t)  +  C2h(t)}  =  e~s,[cjl{t)  +  c2f2{t)]dt 

*/0 

=  c,  f  e~s'f,  (t)dt+c2  f  e  ~s‘f2  (/) dt 
•'O  •'O 

=  Cl^{f\  (0}  +  c2^{fl  (0)* 

It  is  to  be  noted,  though,  that  whereas  f(t)  is  defined  for  0<  t<oo,  its 
Laplace  transform  is  usually  defined  in  a  different  interval.  For  example, 
the  Laplace  transform  of  e2t  is  only  defined  for  2«0<  oo,  and  the  Laplace 
transform  of  eSt  is  only  defined  for  8<s<  oo.  This  is  because  the  integral 
(2)  will  only  exist,  in  general,  if  s  is  sufficiently  large. 

One  very  serious  difficulty  with  the  definition  (2)  is  that  this  integral 
may  fail  to  exist  for  every  value  of  s.  This  is  the  case,  for  example,  if  /(/)  = 
ef  (see  Exercise  13).  To  guarantee  that  the  Laplace  transform  of /(/)  exists 
at  least  in  some  interval  »y>^0,  we  impose  the  following  conditions  on  f(t). 

(i)  The  function  f(t)  is  piecewise  continuous.  This  means  that  /(/)  has  at 
most  a  finite  number  of  discontinuities  on  any  interval  0 <t<A,  and 
both  the  limit  from  the  right  and  the  limit  from  the  left  of  /  exist  at  ev¬ 
ery  point  of  discontinuity.  In  other  words, /(f)  has  only  a  finite  number 
of  “jump  discontinuities”  in  any  finite  interval.  The  graph  of  a  typical 
piecewise  continuous  function  f(t)  is  described  in  Figure  1. 


Figure  1.  Graph  of  a  typical  piecewise  continuous  function 
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(ii)  The  function  f(t)  is  of  exponential  order,  that  is,  there  exist  constants 
M  and  c  such  that 


|/(/)|<  Mec\  0<*<oo. 


Lemma  1.  If  f(t)  is  piecewise  continuous  and  of  exponential  order ,  then  its 
Laplace  transform  exists  for  all  s  sufficiently  large.  Specifically ,  if  f(t)  is 
piecewise  continuous ,  and  |/(/)|  <  Mect,  then  F(s)  exists  for  s>c. 

We  prove  Lemma  1  with  the  aid  of  the  following  lemma  from  integral 
calculus,  which  we  quote  without  proof. 


Lemma  2.  Let  g(t)  be  piecewise  continuous.  Then ,  the  improper  integral 

J°°g(t)dt  exists  if  jc°\g{t)\dt  exists.  To  prove  that  this  latter  integral  ex¬ 
ists ,  it  suffices  to  show  that  there  exists  a  constant  K  such  that 


for  all  A . 


f A\g(t)\dt<K 


Remark.  Notice  the  similarity  of  Lemma  2  with  the  theorem  of  infinite 
series  (see  Appendix  B)  which  states  that  the  infinite  series  2  an  converges 

if  2  \an\  converges,  and  that  2  \an\  converges  if  there  exists  a  constant  K 
such  that  \ax\  +  ...  +  \an\  <  K  for  all  n. 


We  are  now  in  a  position  to  prove  Lemma  1. 

Proof  of  lemma  1.  Since  f(t)  is  piecewise  continuous,  the  integral 

f*e~stf(t)dt  exists  for  all  A.  To  prove  that  this  integral  has  a  limit  for  all  s 
sufficiently  large,  observe  that 


~stectdt 


M 
c  —  s 


e(c~s)A  _ 


M 
s  —  c 


for  s>c.  Consequently,  by  Lemma  2,  the  Laplace  transform  of  /(/)  exists 
for  s>c.  Thus,  from  here  on,  we  tacitly  assume  that  \f(t)\<Mect ,  and 
s>c.  □ 


The  real  usefulness  of  the  Laplace  transform  in  solving  differential 
equations  lies  in  the  fact  that  the  Laplace  transform  of  /'(/)  is  very  closely 
related  to  the  Laplace  transform  of/(/).  This  is  the  content  of  the  follow¬ 
ing  important  lemma. 
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Lemma  3.  Let  F(s)  =  t{f{t)}.  Then 

£{/'(/)}  =  s£{f(t)}-m  =  sF(s)-f(0). 

Proof.  The  proof  of  Lemma  3  is  very  elementary;  we  just  write  down  the 
formula  for  the  Laplace  transform  of  /'(/)  and  integrate  by  parts.  To  wit, 


£{/'(/)}=  lim  P 

A— >oo  Jq 


—  St  ft 


f'(t)dt 


=  lim  e  s‘f{t)  +  lim  s  /  e 

A— >oo  0  A—>  oo 


fAe  s,f{t)dt 


=  -f(0)  +  s  lim  f  e  s'f(t),dt 

A  — >  oo  J  o 


=  -/(0)  +  j.F(.y). 


□ 


Our  next  step  is  to  relate  the  Laplace  transform  of  f"(t)  to  the  Laplace 
transform  of  /(/).  This  is  the  content  of  Lemma  4. 


Lemma  4.  Let  F(s)  =  £{/(0)*  Then, 

£{f"(t)}=s2F(s)-sm-m. 

Proof.  Using  Lemma  3  twice,  we  see  that 

£{r(t)}=st{nt)}-m 

=  s[jF(s)  -/(0)]-/'(0) 

=  s2F  (s)-sf  (0)  -/'(0).  □ 


We  have  now  developed  all  the  machinery  necessary  to  reduce  the  prob¬ 
lem  of  solving  the  initial-value  problem 

+  + o' =/(0;  /(0)=/0>  y'{Q)=y'o  (3) 

to  that  of  solving  an  algebraic  equation.  Let  7(s)  and  F{s)  be  the  Laplace 
transforms  of  y(t)  and  f(t)  respectively.  Taking  Laplace  transforms  of  both 
sides  of  the  differential  equation  gives 

Z{ay"{t)  +  by’{t)  +  cy{t)}  =  F{s). 

By  the  linearity  of  the  Laplace  transform  operator, 

Z{ay"(t)  +  by'(t)  +  cy(t)}=aZ{y"(t)}  +  bZ{y'{t)}+cZ{y(t)}, 
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and  from  Lemmas  3  and  4 

£  { y'( 0 } = s- Y  (s) -y<»  £  { y "( 0 } = s2y  (s)  -  wo  -y'o- 

Hence, 


a[s2Y  (s)  -  ^o-^o]  +  b[sY  (j)  -y0]  +  cY  (s)  =  F(s) 
and  this  algebraic  equation  implies  that 


r,s).  {a?+b)y°  * 


ay0 


+ 


F(s) 


as 2  +  bs  +  c  as2  +  bs  +  c  as2  +  bs  +  c 


Equation  (4)  tells  us  the  Laplace  transform  of  the  solution y(t)  of  (3).  To 
find  >>(/),  we  must  consult  our  anti,  or  inverse,  Laplace  transform  tables. 
Now,  just  as  7(s)  is  expressed  explicitly  in  terms  of  y(t );  that  is,  7(.s) 

=  /  °°e~sty(t)dt,  we  can  write  down  an  explicit  formula  for y{t).  However, 
Jo  . 

this  formula,  which  is  written  symbolically  as  y(t)=£  1{y(5)},  involves 

an  integration  with  respect  to  a  complex  variable,  and  this  is  beyond  the 

scope  of  this  book.  Therefore,  instead  of  using  this  formula,  we  will  derive 

several  elegant  properties  of  the  Laplace  transform  operator  in  the  next 

section.  These  properties  will  enable  us  to  invert  many  Laplace  transforms 

by  inspection;  that  is,  by  recognizing  “which  functions  they  are  the 

Laplace  transform  of”. 


Example  4.  Solve  the  initial-value  problem 


-3—  +2 y  =  e3t- 
dt  y  ’ 


y( o)  =  i,  /(o)-o. 


Solution .  Let  y(>s)=  t{y(t)}.  Taking  Laplace  transforms  of  both  sides  of 
the  differential  equation  gives 

* 2y  (*)  -  *  -  3  [  *  r  (*)  - 1  ] + 2  r  ( *)  =  ^ 
and  this  implies  that 


_ 1 _ 

(,s  — 3)  (s2  — 3  s +  2) 


+ 


s  —  3 

s2  —  3s  +  2 


(J-l)(.-2)(5-3)  +  (5-l)(5-2)- 


To  find  y(t),  we  expand  each  term  on  the  right-hand  side  of  (5)  in  partial 
fractions.  Thus,  we  write 


1 _  A 

l)(5-2)(5-3)  s-  1 


C 

s-3  ' 
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This  implies  that 

i4(j-2)(j-3)  +  £(j-l)(j-3)  +  C(j-l)(j-2)-l.  (6) 

Setting  s  =  1  in  (6)  gives  A  =  \\  setting  s  =  2  gives  B  —  —  1 ;  and  setting  s  =  3 
gives  C=  Hence, 

_ 1 _ =  J__l _ 111 

(s—l)(s  —  2)(s  —  3)  2  s—  1  s  —  2  2  s  —  3 

Similarly,  we  write 

s-3  _  D  E 

(s-l)(s-2)  s- 1  s  —  2 

and  this  implies  that 

Z)  (s  —  2)  +  is  (s  —  1)  =  s  —  3.  (7) 

Setting  5  =  1  in  (7)  gives  D  =  2,  while  setting  s  =  2  gives  E-  -  1.  Hence, 

Y(  w  i_j _ 1,11,2 _ \_ 

2  s— 1  s  —  2  2  s  —  3  s—  1  s  —  2 

=  5_J _ 2  11 

2s— 1  s—2  2s— 3* 

Now,  we  recognize  the  first  term  as  being  the  Laplace  transform  of 
Similarly,  we  recognize  the  second  and  third  terms  as  being  the  Laplace 
transforms  of  -2e2t  and  \e3t,  respectively.  Therefore, 

Y  (s)  =  £  { f  e '  —  2e2t  +  \e3‘ } 

so  that 

y{t)  =  \e*  —  2e2/  +  \e3t. 


Remark.  We  have  cheated  a  little  bit  in  this  problem  because  there  are  ac¬ 
tually  infinitely  many  functions  whose  Laplace  transform  is  a  given  func¬ 
tion.  For  example,  the  Laplace  transform  of  the  function 


>'-2e2'  +  ^>3', 

lo, 


t^=  1,  2,  and  3 
/=  1,2,3 


is  also  F(s),  since  z(t)  differs  from  >>(/)  at  only  three  points.*  However, 
there  is  only  one  continuous  function  y(t)  whose  Laplace  transform  is  a 
given  function  Y (V),  and  it  is  in  this  sense  that  we  write y{t)  =  £~ 1  {  Y (j)}. 


We  wish  to  emphasize  that  Example  4  is  just  by  way  of  illustrating  the 
method  of  Laplace  transforms  for  solving  initial-value  problems.  The  best 
way  of  solving  this  particular  initial-value  problem  is  by  the  method  of 


fb  Cb 

*If  f(t)  —  g(t)  except  at  a  finite  number  of  points,  then  /  f(t)dt—  I  g(t)dt 

J  a  Ja 
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judicious  guessing.  However,  even  though  it  is  longer  to  solve  this  particu¬ 
lar  initial-value  problem  by  the  method  of  Laplace  transforms,  there  is  still 
something  “nice  and  satisfying”  about  this  method.  If  we  had  done  this 
problem  by  the  method  of  judicious  guessing,  we  would  have  first  com¬ 
puted  a  particular  solution  \[/(t)  =  ^e3*.  Then,  we  would  have  found  two  in¬ 
dependent  solutions  e '  and  e2t  of  the  homogeneous  equation,  and  we 
would  have  written 

y(t)  =  cxet  - f  c2e2t  +  \e3t 

as  the  general  solution  of  the  differential  equation.  Finally,  we  would  have 
computed  cx  =  §  and  c2  =  —  2  from  the  initial  conditions.  What  is  unsatisfy¬ 
ing  about  this  method  is  that  we  first  had  to  find  all  the  solutions  of  the 
differential  equation  before  we  could  find  the  specific  solution  y(t)  which 
we  were  interested  in.  The  method  of  Laplace  transforms,  on  the  other 
hand,  enables  us  to  findy(/)  directly,  without  first  finding  all  solutions  of 
the  differential  equation. 


Exercises 

Determine  the  Laplace  transform  of  each  of  the  following  functions. 


1.  t 


2.  tn 


3.  eQi  cos  bt 
5.  cos2  at 
7.  sin  at  cos  bt 


4.  eatsmbt 
6.  sin2  a/ 

8.  /2sinr 


9.  Given  that  x2dx  =  Vn  /2,  find  £{t  1/2}.  Hint :  Make  the  change  of  vari¬ 
able  u  =  Vt  in  (2). 

Show  that  each  of  the  following  functions  are  of  exponential  order. 


10.  tn 


11.  sin  at 


12.  e 


Vt 


2  2 

13.  Show  that  ef  does  not  possess  a  Laplace  transform.  Hint :  Show  that  e r  ~st  > 

for  t>s+  1. 

14.  Suppose  that /(*)  is  of  exponential  order.  Show  that  F(s)=*  £{f(t)}  approaches 
0  as  s-*co. 

Solve  each  of  the  following  initial-value  problems. 

15.  y”  -  5/  +  Ay  =  e2‘;  ^(0)  =  1,  /(0)  =  - 1 

16.  2 y"+/-y  =  e3';  >-(0)=2,/(0)  =  0 

Find  the  Laplace  transform  of  the  solution  of  each  of  the  following  initial- 
value  problems. 

17.  /'  +  2/ +>>  =  <?-';  /(0)=l,/(0)  =  3 
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18.  y"+y  =  /2sin/;  j>(0)=>>'(0)=0 

19.  y "  +  3 y'  +  ly  =  cos /;  y(0)  =  0,  y'(0)  =  2 

20.  y"+/+.y  =  /3;  .y(0)  =  2,/(0)  =  0 

21.  Prove  that  all  solutions  y(/)  of  ay"  +  by'  +  cy  =f(t)  are  of  exponential  order  if 
/(/)  is  of  exponential  order.  Hint :  Show  that  all  solutions  of  the  homogeneous 
equation  are  of  exponential  order.  Obtain  a  particular  solution  using  the 
method  of  variation  of  parameters,  and  show  that  it,  too,  is  of  exponential 
order. 


22.  Let  F(s)  =  £{/(0}-  Prove  that 


=  snF(s)-sn-lf(  0)- 


Hint:  Try  induction. 


dF'lK  0) 

dtn~x 


23.  Solve  the  initial-value  problem 


/"  -  6y "  +  1  \y ' -  6y  =  e4';  y  (0)  =y'(0)  =y"(  0)  -  0 


24.  Solve  the  initial-value  problem 

y"  —2>y'  +  2y  =  e~‘;  >>(/o)=l,  ?(t 0)=0 

by  the  method  of  Laplace  transforms.  Hint :  Let  <j>(t)=y(t  + 10). 


2.10  Some  useful  properties  of  Laplace  transforms 

In  this  section  we  derive  several  important  properties  of  Laplace  trans¬ 
forms.  Using  these  properties,  we  will  be  able  to  compute  the  Laplace 
transform  of  most  functions  without  performing  tedious  integrations,  and 
to  invert  many  Laplace  transforms  by  inspection. 

Property  1.  If  &{f(t)}  =  F{s\  then 

£{-tf(t)}  =  j-sF{s). 

Proof.  By  definition,  F(s)=  f ™e~stf(i)dt .  Differentiating  both  sides  of 

•/  0 

this  equation  with  respect  to  s  gives 

iw-ir'-M* 

=  J  -^(e~“)f(t)dt  =  —  te~s,f{t)dt 

=  £{-'/(')}•  □ 
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Property  1  states  that  the  Laplace  transform  of  the  function  — tf(t )  is 
the  derivative  of  the  Laplace  transform  of  f(t).  Thus,  if  we  know  the 
Laplace  transform  F(s)  of  /(/),  then,  we  don’t  have  to  perform  a  tedious 
integration  to  find  the  Laplace  transform  of  tf(t);  we  need  only  differenti¬ 
ate  F(s)  and  multiply  by  —  1. 


Example  1.  Compute  the  Laplace  transform  of  te*. 

Solution .  The  Laplace  transform  of  el  is  1/(5- 1).  Hence,  by  Property  1, 
the  Laplace  transform  of  te*  is 


£{/*>'}  = 


d  1  =  1 

(j-1)2’ 


Example  2.  Compute  the  Laplace  transform  of  t 13. 
Solution.  Using  Property  1  thirteen  times  gives 


/_  n13  d13  a  ,  ,xl3  d13  1  (13)! 


ds 


ds 13  * 


14 


The  main  usefulness  of  Property  1  is  in  inverting  Laplace  transforms,  as 
the  following  examples  illustrate. 

Example  3.  What  function  has  Laplace  transform  —  1  /(s  —  2 )2? 

Solution.  Observe  that 


1 


d  1 


(s-2? 

Hence,  by  Property  1, 


ds  s  —  2 


and 


1 


5  —  2 


=  £{e2t). 


(.s-2)' 


te2‘. 


Example  4.  What  function  has  Laplace  transform 
Solution.  Observe  that 


45/(52  +  4)2? 


45 


d  2 


(52  +  4) 

Hence,  by  Property  1, 


2  ds  5.2  +  4 


and 


52  +  4 


=  £{sin2/}. 


£~N  - 


45 


(s2+4)2 


=  —t$iri2t. 


Example  5.  What  function  has  Laplace  transform  1/(5  —  4)3? 
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Solution .  We  recognize  that 

1  =  d2  1  1 

(,s-4)3  ds 2  2  J-4 

Hence,  using  Property  1  twice,  we  see  that 


Property  2.  If  F(s)  =  £{/(0),  then 


Proof.  By  definition, 


£{^7(0} 


~steatf(t)dt  =  r°°e(fl-j)7(/)^ 

•'0 


Property  2  states  that  the  Laplace  transform  of  eatf(t)  evaluated  at  the 
point  s  equals  the  Laplace  transform  of  f(t)  evaluated  at  the  point  ( s  —  a). 
Thus,  if  we  know  the  Laplace  transform  F(s )  of  /(/),  then  we  don’t  have  to 
perform  an  integration  to  find  the  Laplace  transform  of  eatf(t );  we  need 
only  replace  every  s  in  F(s)  by  s  —  a. 


Example  6.  Compute  the  Laplace  transform  of  e3/sin  t. 

Solution.  The  Laplace  transform  of  sin/  is  l/(s2+l).  Therefore,  to  com¬ 
pute  the  Laplace  transform  of  e3'sin/,  we  need  only  replace  every  s  by 
s  —  3;  that  is, 

£{e3'sin?}=  T,,' 

(s-3)  +1 

The  real  usefulness  of  Property  2  is  in  inverting  Laplace  transforms,  as 
the  following  examples  illustrate. 


Example  7.  What  function  g(t)  has  Laplace  transform 


Solution.  Observe  that 


s-1 

25  +  (s-7) 


f<s,-77?‘e{cos5,) 

and  that  (/(V)  is  obtained  from  F(s)  by  replacing  every  s  by  s  —  1.  Hence, 
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by  Property  2, 


s-1 

{s  —  1)2  +  25 


t{ent  cos5/}. 


Example  8.  What  function  has  Laplace  transform  1  /(s2  — 4s +  9)1 
Solution.  One  way  of  solving  this  problem  is  to  expand  \/(s2  —  4s  +  9)  in 
partial  fractions.  A  much  better  way  is  to  complete  the  square  of  s2  —  4s  + 
9.  Thus,  we  write 

1  _ 1 _  1 

s2-4s  +  9  s2-4s  +  4  +  {9~4)  (s-2)2  +  5 

Now, 


s2  +  5 


=  £ 


V5 


sin 


V5  t 


Hence,  by  Property  2, 

1 


s2-4s  +  9  (j  —  2)2  +  5 


=  £ 


V5 


e2'sinVJ  t 


Example  9.  What  function  has  Laplace  transform  s/{s2  — 4j  +  9)? 

Solution.  Observe  that 

s  _  s- 2  +  2 

s2  —  4s  +  9  (s-2)2  +  5  (s-2)2  +  5 

The  function  ^/(^2  +  5)  is  the  Laplace  transform  of  cosV5  t .  Therefore,  by 
Property  2, 

(s-2f  +  5  1  ; 

and 


^2  — 4^  +  9 


=  e 


e2t  cos 


V5  /H — —  £2'sinV5  /]. 
V5 


In  the  previous  section  we  showed  that  the  Laplace  transform  is  a  linear 
operator;  that  is 


(0-Pc2/2(/)}  =  c,e{/1  (0}+^2e{/2(0}. 


Thus,  if  we  know  the  Laplace  transforms  and  F2(s)9  of  fx(t)  and  /2(0, 
then  we  don’t  have  to  perform  any  integrations  to  find  the  Laplace  trans¬ 
form  of  a  linear  combination  of  fx(t)  and  f2(t)\  we  need  only  take  the  same 
linear  combination  of  Fx(s)  and  F2(s).  For  example,  two  functions  which 
appear  quite  often  in  the  study  of  differential  equations  are  the  hyperbolic 
cosine  and  hyperbolic  sine  functions.  These  functions  are  defined  by  the 
equations 


coshat  = 


eat  +  e  at 


2 


sinha/  = 


—  e 


—  at 


2 
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Therefore,  by  the  linearity  of  the  Laplace  transform, 

£{coshaf}  =  t{ea‘ }  +  ^  £{e~°' } 


2 

2 


1  +  1 


s— a  s  +  a 


2  2 
s  —  a 


and 


£{sinhaf}  =  ~  £{eat }  -  \&{e  at } 


2 


1 


1 


s  —  a  s  +  a 


a 


2  2 
s  —  a 


Exercises 


Use  Properties  1  and  2  to  find  the  Laplace  transform  of  each  of  the  follow¬ 
ing  functions. 

1.  tn  2.  tneat  3.  tsinat  4.  t2 cosat 

5.  t5'2  (see  Exercise  9,  Section  2.9) 

6.  Let  F(s)=  £{/(/)}>  and  suppose  that  f(t)/t  has  a  limit  as  t  approaches  zero. 
Prove  that 

C) 


£{/«/'}=  f°°F(u)du. 

J  c 


(The  assumption  that  f(t)/t  has  a  limit  as  t->0  guarantees  that  the  integral  on 
the  right-hand  side  of  (*)  exists.) 

7.  Use  Equation  (*)  of  Problem  6  to  find  the  Laplace  transform  of  each  of  the 
following  functions: 


(a) 


sin /  ^  cosat—  1  ,  x  eat  —  ebt 


(b) 


(c) 


Find  the  inverse  Laplace  transform  of  each  of  the  following  functions.  In 
several  of  these  problems,  it  will  be  helpful  to  write  the  functions 

als3  +  fils2+yis  +  8i  als2  +  Pis  +  Y 

/w)=  /  ,  .  .  .  w  .  . — — T  and  Pi(s)  = 


(i as  4-  b){cs2  +  ds  +  e) 


(as2  +  bs  +  c)(ds 2  +  es+f) 
in  the  simpler  form 

As  +  B  ,  Cs  +  D  j  A  ,  Cs  +  D 

P i (*)=—,:,  ,  +— - —  and  p2(s)- - —  + 


as  +bs  +  c  dsz+es+f 


as  +  b  cs2  +  ds  +  e 


8. 


10. 


12. 


(s  +  a)  +  b2 
1 

5(.s2  +  4) 

_ 1 _ 

(s2  +  a2)(s2  +  b2) 


9. 


11. 


13. 


r2-5 


53  +  4s2  +  3.s 
s 

s2  —  3s—  12 
35 


(s+\y 
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14. 


16. 


1 


s(s  +  4)‘ 
1 

(*2+d: 


15. 


(j+DV+i) 


17.  Let  F(s)=  £{/(<)}•  Show  that 

/(0=-  je-'tnj)}. 

Thus,  if  we  know  how  to  invert  F'(s ),  then  we  can  also  invert  F(s). 

18.  Use  the  result  of  Problem  17  to  invert  each  of  the  following  Laplace  transforms 

(a)  ln(  Sf )  (b)  arc  tan  f  (c>  ln( 1  “  7 ) 


Solve  each  of  the  following  initial-value  problems  by  the  method  of 
Laplace  transforms. 

19.  y"  +y  =  sin/;  y(0)=  l,/(0)  =  2 

20.  y"+y  =  tsint;  y( 0)=  l,/(0)  =  2 

21.  y"-2y'+y  =  te‘;  y(0)  =  0, /(0)-0 

22.  y" -2y'  +  ly  =  sinf;  y(0)=0,/(0)  =  0 

23.  y"+/+y  =  l  +  ^-/;  y(0)  =  3, /(0)=  -5 


2.11  Differential  equations  with  discontinuous 
right-hand  sides 

In  many  applications,  the  right-hand  side  of  the  differential  equation  ay"  + 
by'  +  cy=f(t)  has  a  jump  discontinuity  at  one  or  more  points.  For  exam¬ 
ple,  a  particle  may  be  moving  under  the  influence  of  a  force  /j(/),  and 
suddenly,  at  time  tl9  an  additional  force  f2(t)  is  applied  to  the  particle.  Such 
equations  are  often  quite  tedious  and  cumbersome  to  solve,  using  the 
methods  developed  in  Sections  2.4  and  2.5.  In  this  section  we  show  how  to 
handle  such  problems  by  the  method  of  Laplace  transforms.  We  begin  by 
computing  the  Laplace  transform  of  several  simple  discontinuous  func¬ 
tions. 

The  simplest  example  of  a  function  with  a  single  jump  discontinuity  is 
the  function 


0<  t<c 
t>  c 


This  function,  whose  graph  is  given  in  Figure  1,  is  often  called  the  unit  step 
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function,  or  the  Heaviside  function.  Its  Laplace  transform  is 


£{//c(0}  =  /°°  e-«Hc{t)dt=  /V" 


dt 


ro 

=  lim  f  e~st dt=  lim 

A — >oo  ^ q  A — >oo 

p-cs 

= - ,  £>0. 


e-cs__e-sA 


Next,  let  /  be  any  function  defined  on  the  interval  0<  t<  oo,  and  let  g 
be  the  function  obtained  from  /  by  moving  the  graph  of  /  over  c  units  to 
the  right,  as  shown  in  Figure  2.  More  precisely,  g(0  =  0  for  0<  /<c,  and 
g(t)=f(t-c)  for  t>  c.  For  example,  if  c  =  2  then  the  value  of  g  at  /  =  7  is 
the  value  of  /  at  t  =  5.  A  convenient  analytical  expression  for  g(t)  is 

g(t)=Hc  (t)f(t -c). 


The  factor  Hc{t)  makes  g  zero  for  0<  t<  c,  and  replacing  the  argument  t 
of  /  by  t  —  c  moves  /  over  c  units  to  the  right.  Since  g(t)  is  obtained  in  a 
simple  manner  from  /(/),  we  would  expect  that  its  Laplace  transform  can 
also  be  obtained  in  a  simple  manner  from  the  Laplace  transform  of  f(t). 
This  is  indeed  the  case,  as  we  now  show. 
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Property  3.  Let  =  £  {/(7)}.  Then, 


Z{He(t)f(t-c))  =  e-“F(s). 


Proof.  By  definition, 


/•  00 

£  {Hc  (t)f(t -c)}  =  /  e~s,Hc(t)f(t  —  c)dt 

Jo 

r  oo 

=  J  e~stf(t  —  c)dt. 


This  integral  suggests  the  substitution 

i=t-c. 

Then, 


Jr  oo  r < 

e~s,f(t~c)dt=  I 

c  J0 


•*(£+<• 


)/(€)rf{ 


=  e~a  f°°e-*/(t)d( 

Jo 

=  e~csF(s). 

Hence,  £{Hc(t)f(t-  c)}  =  e~cs£{f(t)}.  □ 

Example  1.  What  function  has  Laplace  transform  e~s/s2ct 

Solution.  We  know  that  1/s 2  is  the  Laplace  transform  of  the  function  t. 

Hence,  by  Property  3 

^  =  £{#1  (')('-!)}• 

Sz 

The  graph  of  Hx(t)(t  —  1)  is  given  in  Figure  3. 


Figure  3.  Graph  of  Hx(t)  (t-  1) 
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Example  2.  What  function  has  Laplace  transform  e  3s/(s2  —  2s  —  3)? 
Solution.  Observe  that 

1  =  1  =  1 
s2  —  2s  —  3  s2  —  2s+l  —  4  (5  —  l)2  — 22 

Since  \/{s2  —  22)  =  £{^sinh2/},  we  conclude  from  Property  2  that 


1 


(s—  l)2  — 22 

Consequently,  from  Property  3, 


=  e 


1 

— e'sinh2/ 

2 


— 


s2  — 2^  — 3 


=  e 


^  ^//3  (/)e'  3sinh2(/ 


-3) 


Example  3.  Let  f(t)  be  the  function  which  is  t  for  0  <  t  <  1,  and  0  for  t  >  1. 
Find  the  Laplace  transform  of  /  without  performing  any  integrations. 
Solution.  Observe  that  /(/)  can  be  written  in  the  form 

/(0  =  4//o(0-^i(0]  =  '-^iW- 
Hence,  from  Property  1, 

£{/(0}  =  £{0-£{'tfi(')} 

_  1  |  d  e~s  _  1  e~s  e~s 

s2  ds  s  s 2  s  s2 

Example  4.  Solve  the  initial-value  problem 


dy 

3^+2 y-/(0- 


1,  0<  f  <  1; 

1,  2 <  f  <3; 
1 1,  4<  /<5; 


0,  1  <  t<2; 

0,  3  <  f  <4; 
0,  5  <  /  <  oo . 


y  (0) = o,  /(0)=0 


Solution.  Let  F(Y)=  £{y(7)}  and  F(s)  =  £{/(/)}.  Taking  Laplace  trans¬ 
forms  of  both  sides  of  the  differential  equation  gives  (s2  —  3.S  +  2)  Y(s)  = 
F(s ),  so  that 


Y(s) 


s2  —  3s +  2 


F{s) 

(s-l)(s-2)' 


One  way  of  computing  F(s )  is  to  write  f(t)  in  the  form 


fit)  =  [  h0  (t) -  hx  (0  ]  +  [  h2  (t) -  h3  (0  ]  +  [  ha  (0  -  h5  (t)  ]. 

Hence,  by  the  linearity  of  the  Laplace  transform 


F(s) 


1 

s 


—  s 


—  2s 


+ 


—  3s 


—  4s 


+ 


e 


—  5s 


S 
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A  second  way  of  computing  F(s )  is  to  evaluate  the  integral 
f  e~stf(t)dt=  f  e~stdt+  f  e~stdt+  f  e~st dt 

Jo  Jo  J2  J4 


l  —  e  s  e  2s  —  e  3s  +  e  4s  —  e  5s 


Consequently, 


Y(s)  = 


3 s  1  „-4 s  _  „-5s 


5(5  —  \)(s  —  2) 

Our  next  step  is  to  expand  1/5(5— 1)(5  — 2)  in  partial  fractions;  i.e.,  we 


write 


1 


-4  +  _^+  c 


5(5— 1)(5  — 2)  s  5  —  1  5  —  2 


This  implies  that 


A(s—  1)(5  —  2)  +  £5(5-2)+  Cs(s—  1)  =  1. 


(1) 


Setting  5  =  0  in  (1)  gives  A  =  \ ;  setting  5  =  1  gives  B  =  —  1 ;  and  setting  5  =  2 
gives  C=  Thus, 

1  _  1  1  1,11 
5(5-1)(j-2)  2  s  s-  1  2  s-2 


-eij 


1  2/ 


e‘+2e 


Consequently,  from  Property  3, 

+  H2  (t) [  i  -  e('-2)  +  ^2('"2) ]  -  H3  (/) l  \  -  e('_3)  +  ±e2(,_3)] 
+  -e<'-4)  +  ±e2<'-4>]  -  H5  (/)[£ -e<-5>  +  ie«‘-5>]. 


Remark.  It  is  easily  verified  that  the  function 

i  _  e0~n)+  ie2(t~n) 

and  its  derivative  are  both  zero  at  /  =  n.  Hence,  both y(t)  and y\t)  are  con¬ 
tinuous  functions  of  time,  even  though  /(/)  is  discontinuous  at  f  =  1,  2,  3,  4, 
and  5.  More  generally,  both  the  solution  y(t)  of  the  initial-value  problem 

a'tf+b%+cy=f(t^  ^'o)=*»  yVo)=y'o 
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and  its  derivative  y'(t)  are  always  continuous  functions  of  time,  if  /(f)  is 
piecewise  continuous.  We  will  indicate  the  proof  of  this  result  in  Section 


2.12. 


Exercises 

Find  the  solution  of  each  of  the  following  initial-value  problems. 


1.  y"+2/+>--2(f  — 3)//3(f);  y(0)=2,/(0)=  1 

2.  y"+y'+y  =  H„(t)-H2v(t);  y(0)  =  l,/(0)  =  0 


3.y"+4y={J>  °^4'<4;  y(0)  =  3,/(0)=  -2 
fsinf,  0<t<ir  ^(0)=i  ,/(0)=0 

l  COSt,  7T<t<  oo 


4.  y"+y  = 


5.  y"+y  = 


cos  t,  0</<7r/2 


0,  77-/2</<00 


;  y  (0)  =  3,  .y '(0)  =  —  1 


l-y'^y’+ly^^  ^0)“0>/(°)=0 


8.  y"+y  = 


0<t<l  .  y(0)  =  0,/(0)  =  0 
1  <  t  <  oo 


9.  y"  —  2y'+y=  < 


f  0, 

t, 

lo, 


0<  t <  1 

1  <  f<2  ;  y(0)  =  0,y'(0)=  1 

2  <  /<  oo 


10.  Find  the  Laplace  transform  of  |sinf|.  Hint:  Observe  that 

OO 

|sint|  =  sin/  +  2  2  Hn7T  (t)  sin(/  —  mr). 

n=  1 


11.  Solve  the  initial-value  problem  of  Example  4  by  the  method  of  judicious  guess¬ 
ing.  Hint:  Find  the  general  solution  of  the  differential  equation  in  each  of  the 
intervals  0  <  /  <  1,  1  <  /  <  2,  2  <  t  <3,  3  <  /  <  4,  4  <  /  <  5,  5  <  /  <  oo,  and 
choose  the  arbitrary  constants  so  that  y(t )  and  y\t)  are  continuous  at  the 
points  /=  1,  2,  3,  4,  and  5. 


2.12  The  Dirac  delta  function 

In  many  physical  and  biological  applications  we  are  often  confronted  with 
an  initial-value  problem 

2  i 

tf^  +  ^  +  cF=/(f);  y(0)=y0,  y'(0)=y'0  (1) 
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Figure  1.  The  graph  of  a  typical  impulsive  function  f(t ) 


where  we  do  not  know  f(t)  explicitly.  Such  problems  usually  arise  when  we 
are  dealing  with  phenomena  of  an  impulsive  nature.  In  these  situations,  the 
only  information  we  have  about  f(t)  is  that  it  is  identically  zero  except  for 
a  very  short  time  interval  /0<  t  <  tx,  and  that  its  integral  over  this  time  in¬ 
terval  is  a  given  number  I0=£ 0.  If  /0  is  not  very  small,  then  /(/)  will  be 
quite  large  in  the  interval  t0<t<t Such  functions  are  called  impulsive 
functions,  and  the  graph  of  a  typical  f(t)  is  given  in  Figure  1. 

In  the  early  1930’s  the  Nobel  Prize  winning  physicist  P.  A.  M.  Dirac  de¬ 
veloped  a  very  controversial  method  for  dealing  with  impulsive  functions. 
His  method  is  based  on  the  following  argument.  Let  tx  get  closer  and  closer 
to  t0.  Then  the  function  f(t)/I0  approaches  the  function  which  is  0  for  t 
t0 ,  and  oo  for  t  =  t0 ,  and  whose  integral  over  any  interval  containing  /0  is  1 . 
We  will  denote  this  function,  which  is  known  as  the  Dirac  delta  function, 
by  8(t- 10).  Of  course,  8(t- 10)  is  not  an  ordinary  function.  However,  says 
Dirac,  let  us  formally  operate  with  8(t  —  t0 )  as  if  it  really  were  an  ordinary 
function.  Then,  if  we  set  f(t)  =  I08(t  — 10)  in  (1)  and  impose  the  condition 

(bg(t)8(t-t0)dt={  g(to)  ifa<to (2) 

Ja  {  0  otherwise 

for  any  continuous  function  g(/),  we  will  always  obtain  the  correct  solution 
y(t). 


Remark.  Equation  (2)  is  certainly  a  very  reasonable  condition  to  impose 
on  8  {t  — 10).  To  see  this,  suppose  that  f(t)  is  an  impulsive  function  which  is 
positive  for  /0</</1?  zero  otherwise,  and  whose  integral  over  the  interval 
[/q,^]  is  1.  For  any  continuous  function  g(f), 


min  g(t)]f(t)< 

to  ^  t  ^  t  j 


max 


Consequently, 


min 


2(0]/(0<*<  g{t)f(t)dt<  f  T  max  g(t)]f(t)dt9 
J  Jt0  JtQ  lh<t<tx  J 
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or 

min  g(t)<  f‘x g(t)f(t)dt<  max  g(/). 

■'to  to<t<h 

Thus,  as  tx-*tQ,  f'  g(t)f(t)dt->g(t0). 

J  t0 

Now,  most  mathematicians,  of  course,  usually  ridiculed  this  method. 
“How  can  you  make  believe  that  8(t  — t0)  is  an  ordinary  function  if  it  is 
obviously  not,”  they  asked.  However,  they  never  laughed  too  loud  since 
Dirac  and  his  followers  always  obtained  the  right  answer.  In  the  late 
1940’s,  in  one  of  the  great  success  stories  of  mathematics,  the  French 
mathematician  Laurent  Schwartz  succeeded  in  placing  the  delta  function 
on  a  firm  mathematical  foundation.  He  accomplished  this  by  enlarging  the 
class  of  all  functions  so  as  to  include  the  delta  function.  In  this  section  we 
will  first  present  a  physical  justification  of  the  method  of  Dirac.  Then  we 
will  illustrate  how  to  solve  the  initial- value  problem  (1)  by  the  method  of 
Laplace  transforms.  Finally,  we  will  indicate  very  briefly  the  “germ”  of 
Laurent  Schwartz’s  brilliant  idea. 

Physical  justification  of  the  method  of  Dirac .  Newton’s  second  law  of  mo¬ 
tion  is  usually  written  in  the  form 

^ mv(t)=f(t )  (3) 

where  m  is  the  mass  of  the  particle,  v  is  its  velocity,  and  f(t)  is  the  total 
force  acting  on  the  particle.  The  quantity  mv  is  called  the  momentum  of 
the  particle.  Integrating  Equation  (3)  between  t0  and  tx  gives 

mv(tl)-mv(t0)=  f /lf(t)dt . 

Jt0 

This  equation  says  that  the  change  in  momentum  of  the  particle  from  time 

t0  to  time  tx  equals  J  1  f(t)dt.  Thus,  the  physically  important  quantity  is 

the  integral  of  the  force,  which  is  known  as  the  impulse  imparted  by  the 
force,  rather  than  the  force  itself.  Now,  we  may  assume  that  a>0  in  Equa¬ 
tion  (1),  for  otherwise  we  can  multiply  both  sides  of  the  equation  by  —  1  to 
obtain  a  >  0.  In  this  case  (see  Section  2.6)  we  can  view>>  (/),  for  t  <  /0,  as  the 
position  at  time  t  of  a  particle  of  mass  a  moving  under  the  influence  of  the 
force  —  b(dy/dt)  —  cy.  At  time  t0  a  force  f(t)  is  applied  to  the  particle,  and 
this  force  acts  over  an  extremely  short  time  interval  t0<t<tx.  Since  the 
time  interval  is  extremely  small,  we  may  assume  that  the  position  of  the 
particle  does  not  change  while  the  force  f(t)  acts.  Thus  the  sum  result  of 
the  impulsive  force  /(/)  is  that  the  velocity  of  the  particle  jumps  by  an 
amount  I0/ a  at  time  t0.  In  other  words,  y(t)  satisfies  the  initial-value  prob¬ 
lem 

a^  +  b%+cy=0’  y(0)~y»  /(o)-.vi 
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for  0  <  /  <  /0,  and 


a 


+b^+cy=0; 


, 

z0H - 

a 


for  t  >  t0,  where  z0  and  z'0  are  the  position  and  velocity  of  the  particle  just 
before  the  impulsive  force  acts.  It  is  clear,  therefore,  that  any  method 
which  correctly  takes  into  account  the  momentum  70  transferred  to  the 
particle  at  time  t0  by  the  impulsive  force  f{t)  must  yield  the  correct  answer. 
It  is  also  clear  that  we  always  keep  track  of  the  momentum  70  transferred 
to  the  particle  by  f(t)  if  we  replace /(/)  by  I0S(t  —  t0)  and  obey  Equation 
(2).  Hence  the  method  of  Dirac  will  always  yield  the  correct  answer. 


Remark.  We  can  now  understand  why  any  solution  y(t)  of  the  differential 
equation 


a 


d2y 
dt 2 


+  b±+Cy=f(t), 


f(t)  a  piecewise  continuous  function, 


is  a  continuous  function  of  time  even  though  f(t )  is  discontinuous.  To  wit, 
since  the  integral  of  a  piecewise  continuous  function  is  continuous,  we  see 
thaty'(0>  must  vary  continuously  with  time.  Consequently,  y(t)  must  also 
vary  continuously  with  time. 


Solution  of  Equation  (1)  by  the  method  of  Laplace  transforms.  In  order  to 
solve  the  initial-value  problem  (1)  by  the  method  of  Laplace  transforms, 
we  need  only  know  the  Laplace  transform  of  S  (t  — t0).  This  is  obtained  di¬ 
rectly  from  the  definition  of  the  Laplace  transform  and  Equation  (2),  for 


£{S(/  —  t0)j  =  f  e  sfS(t  — t0)dt  =  e  st°  (for/0>0). 

•'o 


Example  1.  Find  the  solution  of  the  initial-value  problem 

^-4^+4y  =  38(t-\)  +  8(t-2)-  y{ 0)  =  1,  /(0)  =  1. 

Solution.  Let  y(.s)=  £{^(/)).  Taking  Laplace  transforms  of  both  sides  of 
the  differential  equation  gives 

s2Y-s-\-4(sY-\)  +  4Y  =  3e~s  +  e-2s 
or 

(^2-45  +  4)y(ls)  =  ,s-3  +  3e-5  +  ^"2j. 

Consequently, 

v/  s-3  ,  3e~s  ,  e~2s 

Y  (s) - -  H - -  H - -  . 

(s-2f  (s-2)2  {s-2f 
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Now,  1  /(^  —  2)2  =  £{te2t}.  Hence, 


3e 


—  s 


2s 


+ 


(S-2f  (s-2)2 


=  £{3//,  l)e20-l)  +  H2(t)(t-2)e2U~2)}. 


To  invert  the  first  term  of  Y  (s),  observe  that 

— ~ 3  ■  =  — — 2 - 1 - =  £(e2t]-£(te2'] 

(s-2)2  (s-2)2  (s-2)2  *  *  ' 

Thus,  y(t)  =  (1  -  i)e2t  +  3Hx{t)(t -  l)e2«~  !>  +  - 2)e2(‘~2\ 


It  is  instructive  to  do  this  problem  the  long  way,  that  is,  to  find  y(/)  sep¬ 
arately  in  each  of  the  intervals  0<  t<  1,  1  <  t<2  and  2<  /  <  oo.  For  0<  t  < 
Uy(t)  satisfies  the  initial- value  problem 

d2y  dy 

"4^  +  4y  =  0;  y( 0)  =  1,  /(0)  =  1. 

The  characteristic  equation  of  this  differential  equation  is  r2  —  4r  +  4  =  0, 
whose  roots  are  rx  =  r2  =  2.  Hence,  any  solution  y{t)  must  be  of  the  form 
y(t)  =  (ax  +  a2t)e2t.  The  constants  ax  and  a2  are  determined  from  the  initial 
conditions 

1  =y(0)  =  ax  and  1  =y'{0)  =  2ax  +  a2. 

Hence,  ax  =  1,  a2—  -1  and.y(0  =  (l  —  i)e2t  for  0 <  / <  1.  Now  j>(1)  =  0  and 
y\\)  =  -  e 2.  At  time  t  =  1  the  derivative  of  y(t)  is  suddenly  increased  by  3. 
Consequently,  for  1  <  f<2,  y(t)  satisfies  the  initial- value  problem 

^ ~ATt  +4y=0;  /(l)=3-e2. 

Since  the  initial  conditions  are  given  at  t  =  1,  we  write  this  solution  in  the 
form y(t)  =  [bx  + b2(t—\)\e2(t~l)  (see  Exercise  1).  The  constants  bx  and  b2 
are  determined  from  the  initial  conditions 

0=7(1)  =  bx  and  3  —  e2=y'(l)  =  2bx  +  b2. 

Thus,  bx=  0,  b2  =  3  —  e2  and  y(t)  =  (3  —  e2)(t  —  X)e2(<t~x\  l</<2.  Now, 
y(2)  =  (3  —  e2)e2  and  '(2)  =  3(3  —  e2)e2.  At  time  /  =  2  the  derivative  of  7(f) 
is  suddenly  increased  by  1.  Consequently,  for  2<  t  <  co9  y(t)  satisfies  the 
initial-value  problem 

-4-—-  +47  =  0;  y(2)  =  e2(3  —  e2),  y'(2)=l+3e2(3-e2). 

Hence  7(/)  =  [c1  +  c2(/  —  2)]e2('_2).  The  constants  cx  and  c2  are  determined 
from  the  equations 

e2(3-e2)  =  cx  and  l+3e2(3-e2)  =  2cx  +  c2. 
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Thus, 

Cj  —  e2{3  —  e2),  c2=  1  +3e2(3  —  e2)  —  2e2(3-  e2)=  l  +  e2(3  —  e2) 

and  y(t)  =  [e2(3  -  e2)  +  (1  4-  e\3  -  e2))(t  -  l)\e2(<t~2\  t  >  2.  The  reader  should 
verify  that  this  expression  agrees  with  the  expression  obtained  for  >>(/)  by 
the  method  of  Laplace  transforms. 

Example  2.  A  particle  of  mass  1  is  attached  to  a  spring  dashpot  mechanism. 
The  stiffness  constant  of  the  spring  is  1  N /ft  and  the  drag  force  exerted  by 
the  dashpot  mechanism  on  the  particle  is  twice  its  velocity.  At  time  /  =  0, 
when  the  particle  is  at  rest,  an  external  force  e~l  is  applied  to  the  system.  At 
time  t  =  1,  an  additional  force  f(t)  of  very  short  duration  is  applied  to  the 
particle.  This  force  imparts  an  impulse  of  3  N-s  to  the  particle.  Find  the 
position  of  the  particle  at  any  time  t  greater  than  1 . 

Solution.  Let  y(t)  be  the  distance  of  the  particle  from  its  equilibrium 
position.  Then,  y(t )  satisfies  the  initial- value  problem 


+y  =  e-‘  +  38(t—  1); 


>’(0)  =  0,  /(0)  =  0. 


Let  Y(s)=  £ {>>(/)}.  Taking  Laplace  transforms  of  both  sides  of  the  dif¬ 
ferential  equation  gives 


(i2  +  25+l)r(j)=/ry+3e-J,  or  Y(s) 


3e~s 
(s  +  \f 


Since 


we  see  that 

y{t)=t~Y~  +3 Hx  (t){t- l)e~(,-I). 
Consequently, y(t)  =  \t2e~l  +  3(t—  l)e~('_I)  for  t>  1. 


We  conclude  this  section  with  a  very  brief  description  of  Laurent 
Schwartz’s  method  for  placing  the  delta  function  on  a  rigorous  mathemati¬ 
cal  foundation.  The  main  step  in  his  method  is  to  rethink  our  notion  of 
“function.”  In  Calculus,  we  are  taught  to  recognize  a  function  by  its  value 
at  each  time  t.  A  much  more  subtle  (and  much  more  difficult)  way  of  re¬ 
cognizing  a  function  is  by  what  it  does  to  other  functions.  More  precisely, 
let  /  be  a  piecewise  continuous  function  defined  for  -  oo  <  t  <  oo.  To  each 
function  </>  which  is  infinitely  often  differentiable  and  which  vanishes  for  |/ 
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sufficiently  large,  we  assign  a  number  K[<f>]  according  to  the  formula 

*l>]«  f00  <K0/(') du  (5) 

00 

As  the  notation  suggests,  K  is  an  operator  acting  on  functions.  However,  it 
differs  from  the  operators  introduced  previously  in  that  it  associates  a 
number,  rather  than  a  function,  with  <j>.  For  this  reason,  we  say  that  K[4>]  is 
a  functional,  rather  than  a  function.  Now,  observe  that  the  association  <#>-» 
AT[<#>]  is  a  linear  association,  since 

K[c^>l  +  c2^2]=  f  (cl<t>l  +  c2<j>2)(t)f(t)dt 

00 

=  cif  h(0f(0elt  +  c2  f  'hiOfiOd 

J—  00  00 

=  c\^[<l>\]  +c2K[<t>l\- 

Hence  every  piecewise  continuous  function  defines,  through  (5),  a  linear 
functional  on  the  space  of  all  infinitely  often  differentiable  functions  which 
vanish  for  |/|  sufficiently  large. 

Now  consider  the  functional  K[<j>]  defined  by  the  relation  A^[<#>]  =  <#>(/0). 
AT  is  a  linear  functional  since 

K[  c,<h  +  c2<j>2]  =  cl4>l  (t0)  +  c2<t>2(t0)  =  c,  ]  +  c2K[$2]  . 

To  mimic  (5),  we  write  K  symbolically  in  the  form 

*[*]- f  %(*)«(* -*<>)<#•  (6) 

j—  00 

In  this  sense,  8(t  — 10)  is  a  “generalized  function.”  It  is  important  to  realize 
though,  that  we  cannot  speak  of  the  value  of  8(t  —  t0)  at  any  time  t.  The 

only  meaningful  quantity  is  the  expression  /  °°  <j>(t)8(t  —  t0)dt,  and  we 

J—  00 

must  always  assign  the  value  <#>(/0)  to  this  expression. 

Admittedly,  it  is  very  difficult  to  think  of  a  function  in  terms  of  the  lin¬ 
ear  functional  (5)  that  it  induces.  The  advantage  to  this  way  of  thinking, 
though,  is  that  it  is  now  possible  to  assign  a  derivative  to  every  piecewise 
continuous  function  and  to  every  “generalized  function.”  To  wit,  suppose 
that/(0  is  a  differentiable  function.  Then  /'(O  induces  the  linear  func¬ 
tional 

*'[*]-  (7) 

00 

Integrating  by  parts  and  using  the  fact  that  </>(0  vanishes  for  |/|  sufficiently 
large,  we  see  that 

*'(>]=  r  [-*'(o  ]/(')<*-*[  -*']•  (8> 

00 
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Now,  notice  that  the  formula  K'[<$>\  =  K[ -</>']  makes  sense  even  if  f(t)  is 
not  differentiable.  This  motivates  the  following  definition. 

Definition.  To  every  linear  functional  AT[</>]  we  assign  the  new  linear  func¬ 
tional  K'[<f>]  by  the  formula  =  K [  —  </>'].  The  linear  functional  K'[<p] 
is  called  the  derivative  of  K[<f>]  since  if  K[<f>]  is  induced  by  a  differentia¬ 
ble  function  /(/)  then  AT'[</>]  is  induced  by  f(t). 

Finally,  we  observe  from  (8)  that  the  derivative  of  the  delta  function 
8(t  — t0)  is  the  linear  functional  which  assigns  to  each  function  <j>  the  num¬ 
ber  —  for  if  =  </>(/o)  ^en  K'[<t>]  =  K[ ~<t>]  =  o)-  TT*us, 

foo 

•  00 

for  all  differentiable  functions  <£(/). 


Exercises 

1.  Let  a  be  a  fixed  constant.  Show  that  every  solution  of  the  differential  equation 
( d2y  /dt2)  -I-  2  a(dy  /  dt)  +  a2y  =  0  can  be  written  in  the  form 

y(t)  =  [ci  +  c2(t  -  a)]e~~a(‘~a\ 

2.  Solve  the  initial-value  problem  ( d2y / dt2) +  4 {dy / dt) -I-  5 y  =f(t);  y(0)  =  1,  y'(0)  = 
0,  where  f(t)  is  an  impulsive  force  which  acts  on  the  extremely  short  time  inter¬ 
val  1  <  /  <  1  +  t,  and  +  Tf(t)dt  =  2. 

3.  (a)  Solve  the  initial-value  problem  (d2y / dt2)-3(dy / dt)  +  2y  =  /(/);  y(0)=l, 

where  f(t)  is  an  impulsive  function  which  acts  on  the  extremely 

/•2  +  t 

short  time  interval  2  <  /  <  2  +  t,  and  /  f(t)dt=  -  1 . 

(b)  Solve  the  initial-value  problem  (d2y / dt2)-3(dy / dt)  +  2y  =0;  y(0)=  1,  y'(0) 
=  0,  on  the  interval  0<  f<2.  Compute  z0=y(2)  and  z'0=y'(2).  Then  solve 
the  initial-value  problem 

~3^  +2y=0;  y(2)  =  z0,  y'( 2)  =  z’0-l ,  2</<oo. 

Compare  this  solution  with  the  solution  of  part  (a). 

4.  A  particle  of  mass  1  is  attached  to  a  spring  dashpot  mechanism.  The  stiffness 
constant  of  the  spring  is  3  N /m  and  the  drag  force  exerted  on  the  particle  by  the 
dashpot  mechanism  is  4  times  its  velocity.  At  time  t  =  0,  the  particle  is  stretched  \ 
m  from  its  equilibrium  position.  At  time  t  =  3  seconds,  an  impulsive  force  of  very 
short  duration  is  applied  to  the  system.  This  force  imparts  an  impulse  of  2  N  •  s  to 
the  particle.  Find  the  displacement  of  the  particle  from  its  equilibrium  position. 
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In  Exercises  5-7  solve  the  given  initial-value  problem. 

5.  +>>  =  sin/  +  5(/  — 7r);  y(0)=0,  /(0)=0 

dt 2 


6.  ^  +  ^+y=28(t-l)-8(t-2y  y(0)=  l,/(0)=0 

7.  fl+2^+y  =  e-‘  +  3S(t-3y,  .y(0)=0,/(0)  =  3 

8.  (a)  Solve  the  initial-value  problem 


2  S(t-jir), 


j- o 


/(0)=/(0)  =  0, 


and  show  that 


T 


n  even 
n  odd 


in  the  interval  rm <t <(n  +  1)tt. 
(b)  Solve  the  initial-value  problem 


d2y 

dt2 


+y=  2  8(t-2jir), 


7  =  0 


r(0)=/(0)  =  0, 


and  show  that  y  (/)  =  («  +  l)sinf  in  the  interval  2rm <t  <2(n  +  1)tt. 

This  example  indicates  why  soldiers  are  instructed  to  break  cadence  when 
marching  across  a  bridge.  To  wit,  if  the  soldiers  are  in  step  with  the  natural 
frequency  of  the  steel  in  the  bridge,  then  a  resonance  situation  of  the  type  (b) 
may  be  set  up. 

9.  Let  /(/)  be  the  function  which  is  \  for  /  >  t0,  0  for  /  =  /0,  and  -  \  for  /  <  /0.  Let 
K[<j>]  be  the  linear  functional 

*[<#>]=  f°°  <K0f(t)dt. 

j—  oo 

Show  that  K'[(j>]  =  —  =  Thus,  8(t  —  t0)  may  be  viewed  as  the  deriva¬ 

tive  of  /(/). 


2.13  The  convolution  integral 

Consider  the  initial-value  problem 

a^+b~Jt+cy=-f^;  y(Q)=yo>  y'(Q)=y<>-  0) 

Let  Y (s)  =  &{y(t)}  and  F{s)  =  £{/(/)}.  Taking  Laplace  transforms  of 
both  sides  of  the  differential  equation  gives 

a[s2Y  (j)  -  5/c-^o]  +  b[  jr  0) -y0\  +  cY  (s)  =  F(s) 
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and  this  implies  that 


v/  \  as  +  b  ,  a  ,  ,  F(s) 
Y(s)  =  , .  ^0+ . y.'— ■■■Vo+ 


a^  +  fo  +  c  g.s2  +  &y4-c  ay2  4- to  4*  c 


Now,  let 


_  p-i 


j,(?)=e 


and 


—  p-i 


g.s4-  b 

k  gs2  +  to  4-  c 


a 

l  g^2  +  to  4-  c 


Setting /(/)  =  0,y0  =  1  and^o  =  0,  we  see  thaty^/)  is  the  solution  of  the  ho¬ 
mogeneous  equation  which  satisfies  the  initial  conditions  >^(0)  =  l,/.(0)  = 
0.  Similarly,  by  setting  f(t)  =  0,  yo  =  0  and  y'0=  1,  we  see  that  y2(t)  is  the 
solution  of  the  homogeneous  equation  which  satisfies  the  initial  conditions 
y2(0)  =  0,  y'2(0)  =  1.  This  implies  that 


<K0=£_1 


F(s) 


as2+  bs  +  c 


is  the  particular  solution  of  the  nonhomogeneous  equation  which  satisfies 
the  initial  conditions  i//(0)  =  0,  \p'( 0)  =  0.  Thus,  the  problem  of  finding  a  par¬ 
ticular  solution  x//(t)  of  the  nonhomogeneous  equation  is  now  reduced  to 
the  problem  of  finding  the  inverse  Laplace  transform  of  the  function 
F(s)/(as2  +  bs  +  c).  If  we  look  carefully  at  this  function,  we  see  that  it  is 
the  product  of  two  Laplace  transforms;  that  is 


F(s) 

as 2  4 bs  +  c 


=  £{/(0}x£ 


yi(0 

a 


It  is  natural  to  ask  whether  there  is  any  simple  relationship  between  \p(t) 
and  the  functions  f(t)  axidy2{t)/ a.  It  would  be  nice,  of  course,  if  x//(t)  were 
the  product  of  f(t)  withy2(/)/g,  but  this  is  obviously  false.  However,  there 
is  an  extremely  interesting  way  of  combining  two  functions  /  and  g 
together  to  form  a  new  function  f*g ,  which  resembles  multiplication,  and 
for  which 


£{(/**)(')} -£{/(')}  ><£{*(')}• 

This  combination  of  /  and  g  appears  quite  often  in  applications,  and  is 
known  as  the  convolution  of  /  with  g. 

Definition.  The  convolution  ( f*g)(t )  of  /  with  g  is  defined  by  the  equation 

(f*g)(t)=  f  f(t-u) g(u)du.  (2) 

Jo 
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2 

For  example,  if  /(7)  =  sin2/  and  g(t)  =  e* ,  then 

(f*g)(t)=  f  sin2(/  —  u)eu2du. 

The  convolution  operator  *  clearly  bears  some  resemblance  to  the  mul¬ 
tiplication  operator  since  we  multiply  the  value  of  /  at  the  point  t  —  u  by 
the  value  of  g  at  the  point  u ,  and  then  integrate  this  product  with  respect  to 
u.  Therefore,  it  should  not  be  too  surprising  to  us  that  the  convolution  op¬ 
erator  satisfies  the  following  properties. 

Property  1.  The  convolution  operator  obeys  the  commutative  law  of  multi¬ 
plication;  that  is,  (/*g)(0  =  (g*/)(0- 

Proof.  By  definition, 

(/*g)(0=  f  f(t-u)g(u)du . 

Jo 

Let  us  make  the  substitution  t  —  u  —  s  in  this  integral.  Then, 

(f*g)(0=-f  f(s)g(t-s)ds 

Jt 

=  f  g(t-s)f(s)ds=(g*f)(t).  □ 

Jo 

Property  2.  The  convolution  operator  satisfies  the  distributive  law  of  mul¬ 
tiplication;  that  is, 

/*(g  +  /0=/*g +/*/*. 


Proof.  See  Exercise  19.  □ 

Property  3.  The  convolution  operator  satisfies  the  associative  law  of  multi¬ 
plication;  that  is,  ( f*g)*h=f*(g*h ). 

Proof.  See  Exercise  20.  □ 

Property  4.  The  convolution  of  any  function  /  with  the  zero  function  is 
zero. 

Proof.  Obvious.  □ 

On  the  other  hand,  the  convolution  operator  differs  from  the  multiplica¬ 
tion  operator  in  that  /*1^/  and  /*/t^/2.  Indeed,  the  convolution  of  a 
function  /  with  itself  may  even  be  negative. 


Example  1.  Compute  the  convolution  of  f(t)  =  t2  with  g(t)=  1. 
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Solution .  From  Property  1 


(f*g)(0  =  (g*f)(0  =  fQ  l-u2du=j. 


Example  2.  Compute  the  convolution  of  f(t)  =  cost  with  itself,  and  show 
that  it  is  not  always  positive. 

Solution.  By  definition, 


(/*/)(/)  =  j  cos(t  — u)  cos  udu 

=  j  (costcos2u  +  sintsinucosu)du 


.  f 1 1  +cos2w  ,  ,  •  .  Cl  .  , 

=  cost  I  - - - du  +  smt  I  smucosudu 

J0  2  Jn 


t  ,  sin 2/  ,  sm 3t 

=  COS  t  ^-  +  — —  +  ~^— 

2  4  J  2 

t  cos  t  +  sin  t  cos2 1  +  sin3  / 


t  cos  /  +  sin  t  (cos2 1  +  sin2  /) 

=  2 
_  fcos/  +  sku 
2 

This  function,  clearly,  is  negative  for 

(2 n+  1)77  <  t  <(2 n+  1)77+^77,  ^  =  0, 1,2, ... . 

We  now  show  that  the  Laplace  transform  of  f*g  is  the  product  of  the 
Laplace  transform  of  /  with  the  Laplace  transform  of  g. 

Theorem  9.  £{(/*g)(0)  =  £{/( 0}  X  £{  g(t)}. 

Proof.  By  definition, 


£{(/**)(')} 


‘  I ''' 


f(t-u)g(u)du  \dt 


This  iterated  integral  equals  the  double  integral 

/  /  e~s'f{t-u)g{u)dudt 


where  R  is  the  triangular  region  described  in  Figure  1.  Integrating  first 


254 


2.13  The  convolution  integral 


Figure  1 


with  respect  to  /,  instead  of  u,  gives 


/•OO  r 

£{(/*s)(0}=  I  g(“)  f  e~s‘f(t-u)dt  du. 

Jq  l  U 


Setting  /  —  w=£,  we  see  that 


Hence, 


f°°e~*'f(t  —  u)dt=  f°°e~s(u+()f($)d$. 

Ju  Jo 

XOO  r 

g(«)  jf  e~sue~sif(^)di  du 

=  [X  g(u)e  SUdu 

=  £{/(,)}  x  £{*(,)}. 


Example  3.  Find  the  inverse  Laplace  transform  of  the  function 


s2(s2  +  a2) 


Solution.  Observe  that 


-r=£{/}  and  ~-a  =  £{sin  at). 


s2  +  a2 


Hence,  by  Theorem  9 


— ^ — —  |=  f  (t  —  u)s\naudu 
s2{s2  +  a2)  J  Jo 


at  —  sin  at 
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Example  4.  Find  the  inverse  Laplace  transform  of  the  function 

1 


s(s2  +  2s  +  2) 


Solution.  Observe  that 
-j-  =  £{l}  and 

Hence,  by  Theorem  9, 


+  2J  +  2  (*+i)2+i 


=  £{e  'sin/}. 


£ 


-i 


1 


[  s(s2  +  2s  +  2)  J 


=  f  e  “sin udu 
=  ~  £  1  —  e~t (cos/  +  sin/)]. 


Remark.  Let  y2(/)  be  the  solution  of  the  homogeneous  equation  ay"  +  by' 
-hey  =  0  which  satisfies  the  initial  conditions y 2(0)  =  0, y 2(0)  =  1 .  Then, 

y2(t) 

*(0-/(0*-^  (3) 


a 


is  the  particular  solution  of  the  nonhomogeneous  equation  ay "  +  by'  +  cy  = 
/(/)  which  satisfies  the  initial  conditions  \p(0)  =  \///(0)  =  0.  Equation  (3)  is 
often  much  simpler  to  use  than  the  variation  of  parameters  formula  de¬ 
rived  in  Section  2.4. 


Exercises 

Compute  the  convolution  of  each  of  the  following  pairs  of  functions. 


1.  eat,  ebt ,  a=£b 
3.  cos  at,  cos  bt 
5.  sin  a/,  sin  at 


2.  eat ,  eat 
4.  sin  at,  sin  bt,  a^=b 
6.  /,  sin  / 


Use  Theorem  9  to  invert  each  of  the  following  Laplace  transforms. 

1  o  S  a  5 


7. 


10. 


s2(s2+  1) 
1 

j(.y2+  1) 


8. 


11. 


(s+\)(s2  +  4) 
1 

l)2 


9. 


12. 


(*2+iy 

i 

(*2+D: 


Use  Theorem  9  to  find  the  solution  y(t)  of  each  of  the  following  integro- 
differential  equations. 


13.  y(t)  =  4t  —  3  f  y(u)sin(t  —  u)du 

14.  y(t)  =  4t  —  3  f  ly(t  —  u)sinudu 


256 


2.14  The  method  of  elimination  for  systems 


15.  y'(t)  =  sint+  [ty(t—  u)cosudu,  y(0)  =  0 

Jo 

16.  y(t)  =  4t2—  f  y(u)e~^~u^du 

Jo 

17.  y\t)  +  2y+  f  ty(u)du  =  sint,  y(0)  =  1 

Jo 

18.  y{t)—t  —  et  [ty(u)e~u du 

Jo 

19.  Prove  that /*(g  +  h)=f*g+f*h. 

20.  Prove  that  (f*g)*h=f*(g*h). 


2.14  The  method  of  elimination  for  systems 

The  theory  of  second-order  linear  differential  equations  can  also  oe  used 
to  find  the  solutions  of  two  simultaneous  first-order  equations  of  the  form 


•*' = =  a0)x + b(*)y  +f(0 

y'  =  ^  =  c(t)x  +  d(t)y  +  g(t). 


(0 


The  key  idea  is  to  eliminate  one  of  the  variables,  say^y,  and  then  find  x  as 
the  solution  of  a  second-order  linear  differential  equation.  This  technique 
is  known  as  the  method  of  elimination ,  and  we  illustrate  it  with  the  follow¬ 
ing  two  examples. 


Example  1.  Find  all  solutions  of  the  simultaneous  equations 

x'  =  2x+y  + 1 
y'  =  x  +  3y  +  1. 

Solution.  First,  we  solve  for 

y  —  x'  —  2x  —  t 

from  the  first  equation  of  (2).  Differentiating  this  equation  gives 

y'  =  x"  —  2x'  —  1  =  x  +  3y  +  1. 

Then,  substituting  for  y  from  (3)  gives 

x"  —  2x'  —  1  =  x  4-  3(x'  —  2x  —  t)  +  1 

so  that 

x"-5x'  +  5x  =  2-3t. 

Equation  (4)  is  a  second-order  linear  equation  and  its  solution  is 

V5,/2l  0+30 


x(t)  =  e5‘/2  cxe^’,l2  +  c2e  ^ ‘/2 


(2) 


(3) 


(4) 
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for  some  constants  cx  and  c2.  Finally,  plugging  this  expression  into  (3)  gives 


y(t)  =  e5t f2 


1  +  V5 


cxe 


Vs  t/2 


+ 


1-V5 


c2e 


-Vs  t/2 


+ 


t-\ 


Example  2.  Find  the  solution  of  the  initial-value  problem 

x'  =  3x  —y,  x(0)  =  3 

y’  =  x+y,  y(  0)  =  0. 

Solution.  From  the  first  equation  of  (5), 

y  =  3x  —  x'. 

Differentiating  this  equation  gives 

y'  =  3x'  —  x"  =  x+y. 

Then,  substituting  for  y  from  (6)  gives 


3x'  —  x"  =  x  +  3x  —  x' 


so  that 


x"-4x'  +  4jc  =  0. 

This  implies  that 

x(t)  =  (cx  +  c2t)e2t 

for  some  constants  c,,  c2,  and  plugging  this  expression  into  (6)  gives 

y(t)  =  (cl-c2+c2t)e21. 


The  constants  cx  and  c2  are  determined  from  the  initial  conditions 

x(0)  =  3  =  cx 
y(0)  =  0  =  c1  —  c2. 

Hence  cx  =  3,  c2  =  3  and 

x(^)  =  3(l  +  t)e2t,y(t)=s3te 2t 

is  the  solution  of  (5). 


(5) 

(6) 


Remark.  The  simultaneous  equations  (1)  are  usually  referred  to  as  a  first- 
order  system  of  equations.  Systems  of  equations  are  treated  fully  in 
Chapters  3  and  4. 


Exercises 

Find  all  solutions  of  each  of  the  following  systems  of  equations. 

1.  — 3 y  2.  x'=  —  2x+y  + 1 

y'  =  2x+y  y'=  —  4x  +  3y-  1 
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3.  x'=  —  3x  +  2y  4.  x'  =  x+y  +  e‘ 

y’^-x-y  y’  =  x-y-e‘ 

Find  the  solution  of  each  of  the  following  initial-value  problems. 


5.  x’-x+y,  x(0)  =  2 
y'=4x+y,  y(0)=3 

7.  x'=x-y,  x(0)=l 

y’  =  5x-3y,  y( 0)  =  2 

9.  x'  =*4x  +  5y+4e‘ cost,  x(0)=0 
y'=  -2x-2y,  y(0)=0 

11.  x'=2x  —  5>>  +  sin/,  x(0)=0 
y'=x  —  2y  +  tant,  y(0)=0 


6.  x'=x  —  3y,  x(0)=0 

y'—  —2x  +  2y,  y(0)  =  5 

8.  x'  =  3x  —  2y,  x(0)=l 
y'  =  4x—y,  y(0)  =  5 

10.  x'  =  3x— 4 y  +  e1,  x(0)=l 
y'=x-y  +  e‘,  y( 0)=1 

12.  x'=y +/,(»,  x(0)=0 

y'=-x+M0,  y(0)—0 


2.15  Higher-order  equations 

In  this  section  we  briefly  discuss  higher-order  linear  differential  equations. 
Definition.  The  equation 

L[y]=a»(0^+^n-i0)^rr  +  —  +ao(0y-o,  an(t)^o  (i) 

is  called  the  general  nth  order  homogeneous  linear  equation.  The  dif¬ 
ferential  equation  (1)  together  with  the  initial  conditions 

y(to)=yo>  yVo)=yo>--->y(n~l)(to)=yo(n-')  O') 

is  called  an  initial-value  problem.  The  theory  for  Equation  (1)  is  com¬ 
pletely  analogous  to  the  theory  for  the  second-order  linear  homoge¬ 
neous  equation  which  we  studied  in  Sections  2.1  and  2.2.  Therefore,  we 
will  state  the  relevant  theorems  without  proof.  Complete  proofs  can  be 
obtained  by  generalizing  the  methods  used  in  Sections  2. 1  and  2.2,  or  by 
using  the  methods  to  be  developed  in  Chapter  3. 

Theorem  10.  Let  yft), . . .  ,yn(t)  be  n  independent  solutions  of  (1);  that  is ,  no 
solution  yj(t)  is  a  linear  combination  of  the  other  solutions.  Then ,  every 
solution  y(t)  of  (1)  is  of  the  form 

y{t)  =  C\y\(t)+  ...  +  c„yn(t)  (2) 

for  some  choice  of  constants  cu...,cn.  For  this  reason ,  we  say  that  (2)  is 
the  general  solution  of  { 1). 

To  find  n  independent  solutions  of  (1)  when  the  coefficients  aQ ,  ax,...,an 
do  not  depend  on  t ,  we  compute 

L[er‘]  =  (anr"  +  an-1rn~1+  ...+aQ)en. 


(3) 
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This  implies  that  ert  is  a  solution  of  (1)  if,  and  only  if,  r  is  a  root  of  the 
characteristic  equation 

anrn  +  an-\rn~l+  ...  +  tfo  =  0.  (4) 

Thus,  if  Equation  (4)  has  n  distinct  roots  rl9 . . . , rn,  then  the  general  solution 
of  (1)  is  y(t)=  cxerxt  +  ...  +cnernt.  If  ^  =  ay.  +  //?y  is  a  complex  root  of  (4), 
then 

u(t)  =  Re{er;' }  =ea'/ cos  fyt 

and 

v{t)  =  lm{erjt }  =ea''sin fy-t 

are  two  real-valued  solutions  of  (1).  Finally,  if  rx  is  a  root  of  multiplicity  k\ 
that  is,  if 

a„rn+  ...  +a0=(r-rl)kq(r) 

where  q(r{)*£ 0,  then  erit,  terx\...,tk~xerit  are  k  independent  solutions  of 
(1).  We  prove  this  last  assertion  in  the  following  manner.  Observe  from  (3) 
that 

L[er‘]  =  (r-ri)kq(r)en 
if  r,  is  a  root  of  multiplicity  k.  Therefore, 


L[tjer<‘]=L 


Ji 

drJ 


,rt 


r-r  1 


r  =  r. 


=  -^-(r-ri)k<j(r)er' 
=  0,  for  1  <  j  <  k. 


r— r, 


Example  1.  Find  the  general  solution  of  the  equation 

d4y 


dt 


+y=0. 


Solution.  The  characteristic  equation  of  (5)  is  r4+  1  =0.  We  find  the  roots 
of  this  equation  by  noting  that 


Hence, 


-\=e™=  =  eSfri  =  e7«i' 


r\ 


*3 


_  xu  /  4  77  ,  77  1 

=  p  /  =  ms  — 


cos  -J  +  i  sin  -7  = 

4  4  V2 

_  3  •jri  /  4 _  3  77  1  •  •  3  77 

=  e  '  —  cos  —  +  1  sin  —  =  — 

4  4 

=  e  /=cos  —  +  *  sin  —  =  - 

4  4 


(i  +  0> 


(i-0. 
(i  +  0> 
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and 

r4  =  e7™/4  =  cos  +  /  sin  =  — —  ( 1  —  /) 

4  4  V2 

are  4  roots  of  the  equation  r4  + 1  =  0.  The  roots  r3  and  r4  are  the  complex 
conjugates  of  r2  and  rx,  respectively.  Thus, 


r\t  —  pt/^2 


COS 


t  I  •  ’  t 

+  ism 


V2 


V2 


and 


r2t  —  p-t/'ft 


COS 


t  ,  .  .  t 
+ 1  sin 


V2 


V2  J 


are  2  complex-valued  solutions  of  (5),  and  this  implies  that 
y j (/)  =  cos  — — ,  y2(/)  =  sin  — — , 

V2  V2 

y3(/)  =  e~1^  cos  — — ,  and  y4(/)  =  sin  — — 

V  '  V2  W  V2 


are  4  real-valued  solutions  of  (5).  These  solutions  are  clearly  independent. 
Hence,  the  general  solution  of  (5)  is 


y(t)  =  e‘/V* 


t  ,  ,  .  t 

ax  cos  — —  +  Z^sin 


+  e 


t/V2 


V2  '  V2 


t  .  ,  .  f 

a2cos  — —  4-  o,sm 


V2 


V2 


Example  2.  Find  the  general  solution  of  the  equation 

d*y  d3y  d2y  dy 

dt 4  dt2  dt2  dt  U- 


Solution .  The  characteristic  equation  of  (6)  is 

0=  r4  —  3r3  +  3r2—  r(/*3  — 3r2  +  3r  —  1) 

=  r(r—  l)3. 

Its  roots  are  ^  =0  and  r2  =  1,  with  r2=l  a  root  of  multiplicity  three.  Hence, 
the  general  solution  of  (6)  is 

y  (J)  =  cx  +  (c2  +  c3t  +  c4t2)e\ 

The  theory  for  the  nonhomogeneous  equation 

L[y]  =  <*n(t)^jp;  +  ■■■+a0(t)y=f(t),  an{t)^  0  (7) 
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is  also  completely  analogous  to  the  theory  for  the  second-order  nonhomo- 
geneous  equation.  The  following  results  are  the  analogs  of  Lemma  1  and 
Theorem  5  of  Section  2.3. 


Lemma  1.  The  difference  of  any  two  solutions  of  the  nonhomogeneous  equa¬ 
tion  (7)  is  a  solution  of  the  homogeneous  equation  (1). 

Theorem  11.  Let  xf/(t)  be  a  particular  solution  of  the  nonhomogeneous  equa¬ 
tion  (7),  and  let  y  ft),..., yn(t)  be  n  independent  solutions  of  the  homoge¬ 
neous  equation  (1).  Then,  every  solution  y(t)  of  (7)  is  of  the  form 

J'(0=s,H0  +  cl>'l(0+  •••  +CflJn(0 
for  some  choice  of  constants  cx,  c2,---9cn. 

The  method  of  judicious  guessing  also  applies  to  the  Azth-order  equation 

an~dF  +  +aoy  =  [bo+b\t  +  •••  +bktk]eat.  (8) 

It  is  easily  verified  that  Equation  (8)  has  a  particular  solution  \p(t)  of  the 
form 

XP(0  =  [C0+Cl*+  **•  +CA :tk]e<Xt 
if  eat  is  not  a  solution  of  the  homogeneous  equation,  and 

if'(t)  —  tJ'[ £q+  cxt 4- ...  4-  ckt k~^eat 

if  tJ~leat  is  a  solution  of  the  homogeneous  equation,  but  tJeat  is  not. 


Example  3.  Find  a  particular  solution  \p(t)  of  the  equation 

rr  i  _  d3y  ,  d2y  , ,  4y  ,  _  , 

z'[-v]  =  ~+3~+3~+-y=:e- 


dt' 


dt 


dt 


Solution.  The  characteristic  equation 

r3  +  3r2  +  3r+  1  =(r+ 1)3 

has  r=  —  1  as  a  triple  root.  Hence,  el  is  not  a  solution  of  the  homogeneous 
equation,  and  Equation  (9)  has  a  particular  solution  x//( t )  of  the  form 

Computing  L[i//](7)  =  8/le',  we  see  that  A  =  \.  Consequently,  \p(t)  =  \el  is  a 
particular  solution  of  (9). 

There  is  also  a  variation  of  parameters  formula  for  the  nonhomoge¬ 
neous  equation  (7).  Let  v{t)  be  the  solution  of  the  homogeneous  equation 
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(1)  which  satisfies  the  initial  conditions  v(t0)  —  0,  v'(t0)  =  0, . . . , v(n  2\t0)  =  0, 

Then, 

+»-/  w/w* 

*/'o 

is  a  particular  solution  of  the  nonhomogeneous  equation  (7).  We  will  prove 
this  assertion  in  Section  3.12.  (This  can  also  be  proven  using  the  method  of 
Laplace  transforms;  see  Section  2.13.) 


Exercises 

Find  the  general  solution  of  each  of  the  following  equations. 
1.  /"  -  2y"  -y  +  2y  =  0  2.  /"  -  6/'  +  5/  +  \2y  =  0 

3.  y (iv) - 5/"  +  6/'  +  4/  -  8y  =  0  4.  /"-/'+/ -y=0 


Solve  each  of  the  following  initial-value  problems. 


5.  y(iv)  +  4/"  +  14 /'  -  20/  +  25y  =  0;  >>(0)  =/(0)  =/'( 0)  =  0,  /"(0)  =  0 

6.  /iv> -y  =  0;  y( 0)  =  1,  /( 0)  =/'( 0)  =  0,  /"(0)  =  -  1 


7.  /v) - 2/iv)  +/"  =  0;  >>(0)  =/(0)  =/'(0)  =/"( 0)  =  0,  /iv)(0)  =  - 1 

8.  Given  that yf^  —  e1  cos/  is  a  solution  of 

/*)  -  2/"  +y"  +  2/  -  2y  =  0,  (*) 

find  the  general  solution  of  (*).  Hint:  Use  this  information  to  find  the  roots  of 
the  characteristic  equation  of  (*). 

Find  a  particular  solution  of  each  of  the  following  equations. 

9.  /"  +/  =  tant  10.  y(lv)—  y  =  g(t) 


11.  y(iv)+y  =  g(t) 

13.  /"— 4/  =  /  +  cos/  +  2e“2/ 
15.  y (lv)  +  2y"  +y  =  t2  sin  t 


12.  /"+/  =  2/2  +  4sin/ 
14.  y  (lv)  —  y  —  t  +  sin  t 
16.  /vi>+/'  =  *2 


17.  /"  +y  "  +/ +y  =  t  +  e “ 1 r  18.  y (iv)  +  4 /"  +  6y  "  +  4/  ■ +y  =  t3e  ~ ' 

Hint  for  (18):  Make  the  substitution  =  and  solve  for  v.  Otherwise,  it  will 
take  an  awfully  long  time  to  do  this  problem. 
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Systems  of  differential  equations 


3.1  Algebraic  properties  of  solutions  of  linear  systems 

In  this  chapter  we  will  consider  simultaneous  first-order  differential  equa¬ 
tions  in  several  variables,  that  is,  equations  of  the  form 

dx  j 

dx  2 

—  fi  •  •  •  >•*/*)>  (0 


dxn 

dt 


A  solution  of  (1)  is  n  functions  such  that  dxj{t)/dt  — 

fj(t9xl(t),...9xn(t))9  j=  1,2, For  example,  xx{t)  —  t  and  x2(t)—t2  is  a 
solution  of  the  simultaneous  first-order  differential  equations 

dx}  dxj 

-z-  =  1  and  -3—  =2x, 
dt  dt  1 


since  dxx(t)/dt=  1  and  dx2(t)/ dt  =  2t  =  2xx(t). 

In  addition  to  Equation  (1),  we  will  often  impose  initial  conditions  on 
the  functions  xl(t),...9xn(t).  These  will  be  of  the  form 


=  x2(t0)  =  xl...,xn(t0)  =  x°.  (r) 

Equation  (1),  together  with  the  initial  conditions  (1)',  is  referred  to  as  an 
initial-value  problem.  A  solution  of  this  initial-value  problem  is  n  functions 
xx{t)9...9xn(f)  which  satisfy  (1)  and  the  initial  conditions 

For  example,  =  and  x2(t)=l  +  e2t /2  is  a  solution  of  the  initial- 
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value  problem 


dxx 


dx2 

- =xz 


x2(0)  = 


dt  ""v“'  2’ 

since  dxx{t)  / dt  =  e'  =  xx(t),  dx2{t)/ dt  —  e2'  =  x^(t),  x,(0)=  1  and  x2(0)= 
Equation  (1)  is  usually  referred  to  as  a  system  of  n  first-order  differen¬ 
tial  equations.  Equations  of  this  type  arise  quite  often  in  biological  and 
physical  applications  and  frequently  describe  very  complicated  systems 
since  the  rate  of  change  of  the  variable  Xj  depends  not  only  on  t  and  Xj,  but 
on  the  value  of  all  the  other  variables  as  well.  One  particular  example  is  the 
blood  glucose  model  we  studied  in  Section  2.7.  In  this  model,  the  rates  of 
change  of  g  and  h  (respectively,  the  deviations  of  the  blood  glucose  and  net 
hormonal  concentrations  from  their  optimal  values)  are  given  by  the  equa¬ 
tions 


dg 

dt 


=  —  mxg  —  m2h  +  J  (/), 


dh 

dt 


=  —  m3h  +  m4g. 


This  is  a  system  of  two  first-order  equations  for  the  functions  g(t)  and  h{t). 

First-order  systems  of  differential  equations  also  arise  from  higher-order 
equations  for  a  single  variable  y(t).  Every  wth-order  differential  equation 
for  the  single  variable  y  can  be  converted  into  a  system  of  n  first-order 
equations  for  the  variables 


in-  1 


y 


,  s.  ,  v  dy  d 

Examples  1  and  2  illustrate  how  this  works. 


Example  1.  Convert  the  differential  equation 

dny  dn ~ V 

a„(t)-^pr+a„- iM^— f  +  ■■■+%  y  =  0 

into  a  system  of  n  first-order  equations. 

Solution.  Let  xx{t)—y,  x2{t)  =  dy  /  dt,...,  and  xn(t)  =  dn~ly / dtn~x.  Then, 


dxx 


dx • 


dt  *2’  dt 


—  X' 


dxn- 1 


and 


dx 


n 


dt 


dt 

an-i(t)x„  +  an-.2(t)xn-i+  ■■■+aox\ 

a„{t) 


Example  2.  Convert  the  initial-value  problem 
d\  I  dy  \2 

d^+i^J  ~h3y  =  e';  y(0)=l  ^0)  =  0’  /'(o)=o 

into  an  initial-value  problem  for  the  variables  y,  dy / dt,  and  d2y / dt1. 
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Solution .  Set  xx(t)=y9  x2(t)  =  dy / dt9  and  x3(t)  =  d2y/dt2.  Then, 


dxx 

dt 


=  x 


2> 


dx2 

~di 


=  x 


3> 


dx2 

dt 


=  e—  X' 


3xx. 


Moreover,  the  functions  xl9  x2,  and  x3  satisfy  the  initial  conditions  X!(0)  = 
1 ,  x2(0)  =  0,  and  Jt3(0)  =  0. 


If  each  of  the  functions  fX9---Jn  in  (1)  is  a  linear  function  of  the  depen¬ 
dent  variables  xx,...,xn,  then  the  system  of  equations  is  said  to  be  linear. 
The  most  general  system  of  n  first-order  linear  equations  has  the  form 


dxx 

-jf  "*n(0*i  +  •••  +  *i„('K  +  Si(0 
dxn 

+  ...+ann(t)x„  +  gn(t). 


If  each  of  the  functions  gx,...,gn  is  identically  zero,  then  the  system  (2)  is 
said  to  be  homogeneous;  otherwise  it  is  nonhomogeneous.  In  this  chapter, 
we  only  consider  the  case  where  the  coefficients  atj  do  not  depend  on  t. 
Now,  even  the  homogeneous  linear  system  with  constant  coefficients 


dxx 

dt 


*11*1  *  "h  &\nXn 


dx„ 

dt 


=  anlx 


+ 


+  *«.*!. 


is  quite  cumbersome  to  handle.  This  is  especially  true  if  n  is  large.  There¬ 
fore,  we  seek  to  write  these  equations  in  as  concise  a  manner  as  possible. 
To  this  end  we  introduce  the  concepts  of  vectors  and  matrices. 


Definition.  A  vector 


is  a  shorthand  notation  for  the  sequence  of  numbers  xx,...,xn.  The 
numbers  xx,...9xn  are  called  the  components  of  x.  If  xx  =  xx(t)9...9  and 
xn~xn(t),  then 


is  called  a  vector- valued  function.  Its  derivative  dx(t)/dt  is  the  vector- 
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valued  function 

dxx(t ) 
dt 


dx„{t) 

dt 


Definition.  A  matrix 


a\\ 

a  12 

•  •  • 

a\n 

A  = 

a2\ 

• 

• 

a22 

• 

• 

•  •  • 

a2n 

• 

• 

• 

a  ml 

• 

<*m2 

•  •  • 

• 

a 

mn 

is  a  shorthand  notation  for  the  array  of  numbers  a0  arranged  in  m  rows 
and  n  columns.  The  element  lying  in  the  /th  row  and  jth  column  is  de¬ 
noted  by  aip  the  first  subscript  identifying  its  row  and  the  second  sub¬ 
script  identifying  its  column.  A  is  said  to  be  a  square  matrix  if  m  —  n . 

Next,  we  define  the  product  of  a  matrix  A  with  a  vector  x. 

Definition.  Let  A  be  an  n  X  n  matrix  with  elements  and  let  x  be  a  vector 
with  components  We  define  the  product  of  A  with  x,  denoted 

by  Ax,  as  the  vector  whose  /th  component  is 

aiXxx  +  ai2x2  +  ...  +  ainx„, 

In  other  words,  the  /th  component  of  Ax  is  the  sum  of  the  product  of 
corresponding  terms  of  the  /th  row  of  A  with  the  vector  x.  Thus, 


a\\ 

a\2 

•  •  • 

<*\ n 

Ax  = 

a2\ 

m 

• 

a22 

• 

• 

•  •  • 

a2  n 

• 

• 

• 

an\ 

• 

an2 

•  «  • 

• 

n 

a\\x\  +  aX2x2  +  -"+a\nXn 

a2\x\  +  a22X2  +  •  •  •  +  a2nXn 
an\X\  +  an2X2  +  •••  +«„„*„ 

For  example, 


'  l 

2 

4' 

'3' 

'  3+4+4' 

11 

-l 

0 

6 

2 

— 

-3+0+6 

— 

3 

l 

V 

1 

1. 

1 

k*  J 

3  +  2+1 

V.  J 

6J 
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Finally,  we  observe  that  the  left-hand  sides  of  (3)  are  the  components  of 
the  vector  dx/dt ,  while  the  right-hand  sides  of  (3)  are  the  components 
of  the  vector  Ax.  Hence,  we  can  write  (3)  in  the  concise  form 


dx  a  , 

x  =  — —  =  Ax,  where  x= 
dt 


'V 


n 


and  A  = 


axx 

a\2 

•  •  • 

<*1» 

a2\ 

a22 

•  •  • 

a2n 

• 

• 

• 

^n\ 

• 

• 

• 

0„2 

•  •  • 

• 

• 

• 

Moreover,  if  xx(t),...,xn(t)  satisfy  the  initial  conditions 
then  x(t)  satisfies  the  initial- value  problem 


x  =  Ax,  x(/0)  =  x°,  where  x°  = 


x° 


For  example,  the  system  of  equations 


dx] 

dt 

dx2 

~dt 

dx  3 

dt 


=  3xx  —  lx2  +  9x3 


=  1 5  jcj  +  x2  —  x3 


=  7xl  +  6x3 


can  be  written  in  the  concise  form 


3 

-7 

9] 

• 

X  = 

15 

1 

-1 

X, 

x  = 

x2 

7 

0 

6 

> 

.*3. 

and  the  initial-value  problem 

dxx 


dt 
dx2 
dt 
dx o 


=  xx-x2  +  x3, 


=  3xt  —  x 


3> 


=  xx  +  7x: 


dt  -l  ’  -3, 

can  be  written  in  the  concise  form 


x  i(0)  =  1 


x2(0)  =  0 


x3(0)=-l 


• 

x  = 

1 

0 

-l 

3- 

r 

-l 

X, 

II 

g 

r 

0 

l 

0 

7j 

l-ij 

(4) 


(5) 
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Now  that  we  have  succeeded  in  writing  (3)  in  the  more  manageable 
form  (4),  we  can  tackle  the  problem  of  finding  all  of  its  solutions.  Since 
these  equations  are  linear,  we  will  try  and  play  the  same  game  that  we 
played,  with  so  much  success,  with  the  second-order  linear  homogeneous 
equation.  To  wit,  we  will  show  that  a  constant  times  a  solution  and  the 
sum  of  two  solutions  are  again  solutions  of  (4).  Then,  we  will  try  and  show 
that  we  can  find  every  solution  of  (4)  by  taking  all  linear  combinations  of  a 
finite  number  of  solutions.  Of  course,  we  must  first  define  what  we  mean 
by  a  constant  times  x  and  the  sum  of  x  and  y  if  x  and  y  are  vectors  with  n 
components. 


Definition.  Let  c  be  a  number  and  x  a  vector  with  n  components  x„ . . .  ,xn. 
We  define  cx  to  be  the  vector  whose  components  are  cxv...,cxn,  that  is 


cx  =  c 


r 

V. _ 

x2 

• 

II 

cx2 

• 

_ 

• 

For  example,  if 


c  =  2  and  x  = 

'3' 

1 

,  then  2x  =  2 

f3l 

1 

— 

<51 

2 

7 

k.  J 

7 

V.  J 

14  J 

This  process  of  multiplying  a  vector  x  by  a  number  c  is  called  scalar 
multiplication. 


Definition.  Let  x  and  y  be  vectors  with  components  xl9...,xn  and y l9 ...,yn 
respectively.  We  define  x  +  y  to  be  the  vector  whose  components  are 
Xj+.y that  is 


For  example,  if 


then 


x\  ' 

<  > 

y  i 

xi+yi' 

x2 

y  2 

x2+y2 

x+y  = 

• 

• 

• 

^  J 

+ 

• 

• 

• 

yn 

K.  J 

• 

• 

• 

.  *n  +yn . 

rr 

r-r 

X  = 

6 

3 

[2. 

and  y  = 

-6 

7 

9J 

This  process  of  adding  two  vectors  together  is  called  vector  addition. 
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Having  defined  the  processes  of  scalar  multiplication  and  vector  addi¬ 
tion,  we  can  now  state  the  following  theorem. 

Theorem  1.  Let  x(7)  and  y(t)  be  two  solutions  of  ( 4).  Then  ( a )  cx(/)  is  a  solu¬ 
tion,  for  any  constant  c,  and  ( b )  x(/)4-  y(t)  is  again  a  solution. 

Theorem  1  can  be  proven  quite  easily  with  the  aid  of  the  following 
lemma. 

Lemma.  Let  A  be  an  nXn  matrix.  For  any  vectors  x  and  y  and  constant  c, 
(i a )  A(cx)  =  cAx  and  ( b )  A(x  +  y)  =  Ax  -I-  Ay . 

Proof  of  lemma. 

(a)  We  prove  that  two  vectors  are  equal  by  showing  that  they  have  the 
same  components.  To  this  end,  observe  that  the  zth  component  of  the  vec¬ 
tor  cAx  is 


caixxx  4-  cai2x2  +  . . .  +  cainxn  =  c{aixxx  +  . . .  4* ainxn), 
and  the  /th  component  of  the  vector  A(cx)  is 

an  {cxx)  +  aa(cx2)  +  . . .  +  ain(cxn)  =  c(anxx  +  . . .  +  ainxn). 

Hence  A(cx)  =  cAx. 

(b)  The  /th  component  of  the  vector  A(x  +  y)  is 

«/i  (*i  +>’i)  +  •  •  •  +  am(x„  +yn )  =  (anx\  +■■■+  ainx„ )  +  {anyx+...+  ainyn). 

But  this  is  also  the  /th  component  of  the  vector  Ax  +  Ay  since  the  /th  com¬ 
ponent  of  Ax  is  aixxx  +  ...  +ainxn  and  the  zth  component  of  Ay  is  aiXyx 
+  . . .  +  ainyn.  Hence  A(x  +  y)  =  Ax 4-  Ay.  □ 

Proof  of  theorem  1 . 

(a) .  If  x(/)  is  a  solution  of  (4),  then 

^cx(t)  =  c^^  =  cAx(t)  =  A(cx(t)). 

Hence,  cx(/)  is  also  a  solution  of  (4). 

(b) .  If  x(/)  and  y(7)  are  solutions  of  (4)  then 

d  ,  >.  dx(t)  dy(t) 

jj-(x(')+y(0)  =  ~^r  +  ~~dT  =Ax(0+Ay(0=A(x(/)+y(0)- 

Hence,  x(/)  +  y(0  is  also  a  solution  of  (4).  □ 

An  immediate  corollary  of  Theorem  1  is  that  any  linear  combination  of 
solutions  of  (4)  is  again  a  solution  of  (4).  That  is  to  say,  if  x1(/),...,x/(/)  are 
j  solutions  of  (4),  then  c1x1(/)4- ...  +Cjx/(i)  is  again  a  solution  for  any 
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choice  of  constants  cx,c2, For  example,  consider  the  system  of  equa¬ 
tions 


dxx 

nr=x» 


dx2 

dt 


=  —4xx,  or 


dx 

dt 


This  system  of  equations  was  derived  from  the  second-order  scalar  equa¬ 
tion  (d2y /dt2)  +  4y  =  0  by  setting  xx=y  and  x2  =  dy / dt.  Since >>,(/)  =  cos 2/ 
dLn&y2(t)  =  sm2t  are  two  solutions  of  the  scalar  equation,  we  know  that 


*!(*)!_  /  cos2/\  /  sin2f 

x2  (t)J  y— 2sin2  tf  2y2cos2f 


cxcos2t  +  c2  sin2 1 
2c1sin2/  +  2c2  cos2 1 


is  a  solution  of  (6)  for  any  choice  of  constants  cx  and  c2. 

The  next  step  in  our  game  plan  is  to  show  that  every  solution  of  (4)  can 
be  expressed  as  a  linear  combination  of  finitely  many  solutions.  Equiv¬ 
alently,  we  seek  to  determine  how  many  solutions  we  must  find  before  we 
can  generate  all  the  solutions  of  (4).  There  is  a  branch  of  mathematics 
known  as  linear  algebra,  which  addresses  itself  to  exactly  this  question,  and 
it  is  to  this  area  that  we  now  turn  our  attention. 


Exercises 


In  each  of  Exercises  1-3  convert  the  given  differential  equation  for  the  sin¬ 
gle  variable  y  into  a  system  of  first-order  equations. 


*  -  (!)!-° 


1.  -^-  + 


2. 


d3y 


— 


+  cosy  =  e 


dt 3  V  dt  )  dt 3 

4.  Convert  the  pair  of  second-order  equations 


3. 

dt 4  dt 2 


+3—  +2y  =  0, 
dt2  dt  * 


d2z  ,-,dy 
—  +3-j-  +2z=0 
dt2  dt 


into  a  system  of  4  first-order  equations  for  the  variables 

xx  =y,  x2  =y',  x3  =  z,  and  x4  =  z'. 


5.  (a)  Lety(/)  be  a  solution  of  the  equation  y"+y'+y  =  0.  Show  that 


^(0 


is  a  solution  of  the  system  of  equations 

H-?  -!)x- 
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(b)  Let 


*i(0 

x2(t) 


be  a  solution  of  the  system  of  equations 


Show  that>>  =  *!(/)  is  a  solution  of  the  equation  y ”  +>>'+>'  =  0. 


In  each  of  Exercises  6-9,  write  the  given  system  of  differential  equations 
and  initial  values  in  the  form  x  =  A x,x(70)  =  x°. 


6.  x1=3x1  —  7x2,  Xi(0)=\ 
x2  =  4x„  *2(°)  =1 


7.  xi  =  5x1  +  5x2,  Xj(3)  =  0 

x2=  — X!+7x2,  x2(3)  =  6 


8.  xx  =  xx  +  x2  —  x3,  Xj(0)  =  0 

X2  =  3*i-x2  +  4x3,  x2(0)  =  1 
x3= -Xi-X2,  x3(0)=-l 


9.  x,  =  —  x3,  Xj(  1)  =  2 
x2=  xj,  x2(— 1)  =  3 
^3=“^  x3(-\)  =  4 


10.  Let 


Compute  x  +  y  and  3x  —  2y . 


and  y  = 


11.  Let 


A  = 


Compute  Ax  if 


1  2  —  1  \ 

3  0  4  . 

-1  -1  2/ 


12.  Let  A  be  any  nXn  matrix  and  let  eJ  be  the  vector  whose  j th  component  is  1 
and  whose  remaining  components  are  zero.  Verify  that  the  vector  AeJ  is  the  yth 
column  of  A. 


13.  Let 


Compute  Ax  if 


o 

1 

0 


14.  Let  A  be  a  3  X  3  matrix  with  the  property  that 


Ml  1 
and  Al  —  1 

V  -1 


1\ 

2 

3 
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Compute 


Hint :  Write 


as  a  linear  combination  of 


15.  Let  A  be  a  2  X  2  matrix  with  the  property  that 

a(|)-(4)  and  a(_])-(:|). 
Find  A.  Hint :  The  easy  way  is  to  use  Exercise  12. 


3.2  Vector  spaces 

In  the  previous  section  we  defined,  in  a  natural  manner,  a  process  of 
adding  two  vectors  x  and  y  together  to  form  a  new  vector  z  =  x  +  y,  and  a 
process  of  multiplying  a  vector  x  by  a  scalar  c  to  form  a  new  vector  u  =  cx. 
The  former  process  was  called  vector  addition  and  the  latter  process  was 
called  scalar  multiplication.  Our  study  of  linear  algebra  begins  with  the 
more  general  premise  that  we  have  a  set  V  of  elements  x,y,z,...  and  that 
we  have  one  process  that  combines  two  elements  x  and  y  of  V  to  form  a 
third  element  z  in  V  and  a  second  process  that  combines  a  number  c  and 
an  element  x  in  V  to  form  a  new  element  u  in  V.  We  will  denote  the  first 
process  by  addition;  that  is,  we  will  write  z  =  x  +  y,  and  the  second  process 
by  scalar  multiplication;  that  is,  we  will  write  u  =  cx,  if  they  satisfy  the  us¬ 
ual  axioms  of  addition  and  multiplication.  These  axioms  are: 

(i)  x  +  y=y +x  (commutative  law) 

(ii)  x  +  (y  +  z)  =  (x  +  y)  +  z  (associative  law) 

(iii)  There  is  a  unique  element  in  V,  called  the  zero  element,  and  denoted 
by  0,  having  the  property  that  x  +  0  =  x  for  all  x  in  V. 

(iv)  For  each  element  x  in  V  there  is  a  unique  element,  denoted  by  —  x 
and  called  minus  x,  such  that  x  +  (— x)  =  0. 

(v)  1  *x  =  x  for  all  x  in  V. 

(vi)  (ab)x  =  a(bx)  for  any  numbers  a,b  and  any  element  x  in  V. 

(vii)  a(x  +  y)  =  tfx  +  tfy 

(viii)  (a  + b)x  =  ax+ bx. 

A  set  V,  together  with  processes  addition  and  multiplication  satisfying 
(i)— (viii)  is  said  to  be  a  vector  space  and  its  elements  are  called  vectors.  The 
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numbers  a,  b  will  usually  be  real  numbers,  except  in  certain  special  cases 
where  they  will  be  complex  numbers. 

Remark  1.  Implicit  in  axioms  (i)-(viii)  is  the  fact  that  if  x  and  y  are  in  V, 
then  the  linear  combination  ax  +  by  is  again  in  V  for  any  choice  of  con¬ 
stants  a  and  b. 


Remark  2.  In  the  previous  section  we  defined  a  vector  x  as  a  sequence  of  n 
numbers.  In  the  more  general  context  of  this  section,  a  quantity  x  is  a  vec¬ 
tor  by  dint  of  its  being  in  a  vector  space.  That  is  to  say,  a  quantity  x  is  a 
vector  if  it  belongs  to  a  set  of  elements  V  which  is  equipped  with  two 
processes  (addition  and  scalar  multiplication)  which  satisfy  (i)— (viii).  As  we 
shall  see  in  Example  3  below,  the  set  of  all  sequences 


x  = 


of  n  real  numbers  is  a  vector  space  (with  the  usual  operations  of  vector 
addition  and  scalar  multiplication  defined  in  Section  3.1).  Thus,  our  two 
definitions  are  consistent. 


Example  1.  Let  V  be  the  set  of  all  functions  x(t)  which  satisfy  the  differen¬ 
tial  equation 


with  the  sum  of  two  functions  and  the  product  of  a  function  by  a  number 
being  defined  in  the  usual  manner.  That  is  to  say, 


(/.+/2)(0=/l(0 +/*(*) 

and 


It  is  trivial  to  verify  that  V  is  a  vector  space.  Observe  first  that  if  xl  and  x2 
are  in  V,  then  every  linear  combination  cxxx  +  c2x 2  is  in  V,  since  the  dif¬ 
ferential  equation  (1)  is  linear.  Moreover,  axioms  (i),  (ii),  and  (v)-(viii)  are 
automatically  satisfied  since  all  we  are  doing  at  any  time  t  in  function 
addition  and  multiplication  of  a  function  by  a  number  is  adding  or  multi¬ 
plying  two  numbers  together.  The  zero  vector  in  V  is  the  function  whose 
value  at  any  time  t  is  zero;  this  function  is  in  V  since  x(f)=0  is  a  solution 
of  (1).  Finally,  the  negative  of  any  function  in  V  is  again  in  V,  since  the 
negative  of  any  solution  of  (1)  is  again  a  solution  of  (1). 
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Example  2.  Let  V  be  the  set  of  all  solutions  x{t)  of  the  differential  equa¬ 
tion  (d2x/dt2)  —  6jc2  =  0,  with  the  sum  of  two  functions  and  the  product  of 
a  function  by  a  number  being  defined  in  the  usual  manner.  V  is  not  a  vec¬ 
tor  space  since  the  sum  of  any  two  solutions,  while  being  defined,  is  not 
necessarily  in  V.  Similarly,  the  product  of  a  solution  by  a  constant  is  not 
necessarily  in  V.  For  example,  the  function  x(t)=\/t 2  is  in  V  since  it 
satisfies  the  differential  equation,  but  the  function  2x(7)  =  2/ 1 2  is  not  in  V 
since  it  does  not  satisfy  the  differential  equation. 

Example  3.  Let  V  be  the  set  of  all  sequences 

f*il 


x  = 


of  n  real  numbers.  Define  x  +  y  and  cx  as  the  vector  addition  and  scalar 
multiplication  defined  in  Section  3.1.  It  is  trivial  to  verify  that  V  is  a  vector 
space  under  these  operations.  The  zero  vector  is  the  sequence 

'  0  ' 

0 

0 

V  J 


and  the  vector  —  x  is  the  vector 


This  space  is  usually  called  n  dimensional  Euclidean  space  and  is  denoted 
by  FT. 

Example  4.  Let  V  be  the  set  of  all  sequences 


of  n  complex  numbers  x{,...,xn.  Define  x  +  y  and  cx,  for  any  complex 
number  c,  as  the  vector  addition  and  scalar  multiplication  defined  in  Sec¬ 
tion  3.1.  Again,  it  is  trivial  to  verify  that  V  is  a  vector  space  under  these 
operations.  This  space  is  usually  called  complex  n  dimensional  space  and  is 
denoted  by  C". 
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Example  5.  Let  V  be  the  set  of  all  n  X  n  matrices  A.  Define  the  sum  of  two 
matrices  A  and  B  to  be  the  matrix  obtained  by  adding  together  corre¬ 
sponding  elements  of  A  and  B,  and  define  the  matrix  cA  to  be  the  matrix 
obtained  by  multiplying  every  element  of  A  by  c.  In  other  words, 


a\\  an 

...  dXfJ 

b\\  bl2 

...  bXn 

a2\  a22 

•  • 

•  • 

•  • 

•  •  •  a2n 

• 

• 

• 

+ 

b2\  b22  ...  b2n 

•  •  • 

•  •  • 

•  •  • 

a„i  a„  2 

. . .  dnn 

bni  bn  2 

. . .  bnn 

> 

aw  +  bu  aX2  +  bn 
d2\  bj\  a22+  ^22 


“ \n  +  b\n 
<*2n  +  K 


and 


l 

an\  +  bn 

1  an2 

+  bn2 

•  •  • 

^ nn  b nn  J 

aw 

a\2 

•  •  • 

a\n 

cau 

ca\2 

•  •  • 

C“\n 

a2 1 

• 

a22 

• 

•  •  • 

a2n 

• 

ca2 1 

• 

Ca22 

• 

•  •  • 

C“2„ 

• 

• 

• 

• 

anl 

• 

• 

an2 

•  •  • 

• 

• 

®nn 

r - 

" 

• 

• 

can2 

•  •  • 

- \ 

s: 

.  .  Q* 

Axioms  (i),  (ii),  and  (v)-(viii)  are  automatically  satisfied  since  all  we  are 
doing  in  adding  two  matrices  together  or  multiplying  a  matrix  by  a  number 
is  adding  or  multiplying  two  numbers  together.  The  zero  vector,  or  the 
matrix  0,  is  the  matrix  whose  every  element  is  the  number  zero,  and  the 
negative  of  any  matrix  A  is  the  matrix 

~  d\\  •••  ~  n 

•  • 

•  •  • 

•  • 

~  Cl n\  •  •  •  —  Ctnn 

Hence  V  is  a  vector  space  under  these  operations  of  matrix  addition  and 
scalar  multiplication. 

Example  6.  We  now  present  an  example  of  a  set  of  elements  which  comes 
close  to  being  a  vector  space,  but  which  doesn’t  quite  make  it.  The  purpose 
of  this  example  is  to  show  that  the  elements  of  V  can  be  just  about  any¬ 
thing,  and  the  operation  of  addition  can  be  a  rather  strange  process.  Let  V 
be  the  set  consisting  of  three  animals,  a  cat,  a  dog,  and  a  mouse.  Whenever 
any  two  of  these  animals  meet,  one  eats  up  the  other  and  changes  into  a 
different  animal.  The  rules  of  eating  are  as  follows. 
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(1)  If  a  dog  meets  a  cat,  then  the  dog  eats  up  the  cat  and  changes  into  a 
mouse. 

(2)  If  a  dog  meets  another  dog,  then  one  dog  eats  up  the  other  and 
changes  into  a  cat. 

(3)  If  a  dog  meets  a  mouse,  then  the  dog  eats  up  the  mouse  and  remains 
unchanged. 

(4)  If  a  cat  meets  another  cat,  then  one  cat  eats  up  the  other  and  changes 
into  a  dog. 

(5)  If  a  cat  meets  a  mouse,  then  the  cat  eats  up  the  mouse  and  remains  un¬ 
changed. 

(6)  If  a  mouse  meets  another  mouse,  then  one  mouse  eats  up  the  other  and 
remains  unchanged. 

Clearly,  “eating”  is  a  process  which  combines  two  elements  of  V  to  form  a 
third  element  in  V.  If  we  call  this  eating  process  addition,  and  denote  it  by 
+  ,  then  rules  1-6  can  be  written  concisely  in  the  form 

1.  D  +  C  =  M  2.D  +  D  =  C  3.  D  +  M=D 

4.  C+C  =  D  5.  C+M=C  6.  M+  M=M. 

This  operation  of  eating  satisfies  all  the  axioms  of  addition.  To  see  this, 
note  that  axiom  (i)  is  satisfied  since  the  eating  formulae  do  not  depend  on 
the  order  of  the  two  animals  involved.  This  is  to  say,  D  +  C  =  C  +  D,  etc. 
Moreover,  the  result  of  any  addition  is  again  an  animal  in  V.  This  would 
not  be  the  case,  for  example,  if  a  dog  ate  up  a  cat  and  changed  into  a 
hippopotamus.  The  associative  law  (ii)  is  also  satisfied,  but  it  has  to  be 
verified  explicitly.  For  example,  suppose  that  we  have  an  encounter  be¬ 
tween  two  cats  and  a  dog.  It  is  not  obvious,  a  priori,  that  it  does  not  make 
a  difference  whether  the  two  cats  meet  first  and  their  resultant  meets  the 
dog  or  whether  one  cat  meets  the  dog  and  their  resultant  meets  the  other 
cat.  To  check  that  this  is  so  we  compute 

(C+C)  +  Z)  =  Z)  +  Z)  =  C 

and 

(C  +  Z))  +  C=M+C=C. 

In  a  similar  manner  we  can  show  that  the  result  of  any  encounter  between 
three  animals  is  independent  of  the  order  in  which  they  meet.  Next,  ob¬ 
serve  that  the  zero  element  in  V  is  the  mouse,  since  every  animal  is  un¬ 
changed  after  eating  a  mouse.  Finally,  “minus  a  dog”  is  a  cat  (since  D+C 
=  M ),  “minus  a  cat”  is  a  dog  and  “minus  a  mouse”  is  a  mouse.  However, 
V  is  not  a  vector  space  since  there  is  no  operation  of  scalar  multiplication 
defined.  Moreover,  it  is  clearly  impossible  to  define  the  quantities  aC  and 
aD ,  for  all  real  numbers  a ,  so  as  to  satisfy  axioms  (v)-(viii). 
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Example  7.  Let  V  be  the  set  of  all  vector-valued  solutions 

x,(0 

x(/)=  : 

x„(t) 

V 

of  the  vector  differential  equation 

r  — 

Cl  1 1  •  •  • 

x  =  Ax,  A=  : 

dni 

V  is  a  vector  space  under  the  usual  operations  of  vector  addition  and 
scalar  multiplication.  To  wit,  observe  that  axioms  (i),  (ii),  and  (v)-(viii)  are 
automatically  satisfied.  Hence,  we  need  only  verify  that 

(a)  The  sum  of  any  two  solutions  of  (2)  is  again  a  solution. 

(b)  A  constant  times  a  solution  of  (2)  is  again  a  solution. 

(c)  The  vector-valued  function 

*i(0  [  o 

*w=  ;  =  ; 

x„{t)  { o 

v  J 

is  a  solution  of  (2)  (axiom  (iii)). 

(d)  The  negative  of  any  solution  of  (2)  is  again  a  solution  (axiom  (iv)). 

Now  (a)  and  (b)  are  exactly  Theorem  1  of  the  previous  section,  while  (d)  is 
a  special  case  of  (b).  To  verify  (c)  we  observe  that 

r  o 

d_  . 
dt 

l  0 

Hence  the  vector-valued  function  x(/)=0  is  always  a  solution  of  the  dif¬ 
ferential  equation  (2). 

Exercises 

In  each  of  Problems  1-6,  determine  whether  the  given  set  of  elements 

X  =  X2 
*3 

V.  J 

form  a  vector  space  under  the  properties  of  vector  addition  and  scalar  mul¬ 
tiplication  defined  in  Section  3.1. 
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1.  The  set  of  all  elements  x=  (  x2  )  where  3xl  —  2x2  =  0 

x3 

X\ 

2.  The  set  of  all  elements  x  =  |  x2  |  where  xx  +  x2  +  x3  =  0 

x3 

X\\ 

3.  The  set  of  all  elements  x  =  |  x2  |  where  x  j  +  x2  +  x3  =  1 

*3 

4.  The  set  of  all  elements  x  =  |  x2  |  where  xx  +  x2  +  x3  =  1 

*3 

5.  The  set  of  elements  x=  |  a  |  for  all  real  numbers  a  and  b 

b 


6.  The  set  of  all  elements  x  =  |  x2  |  where 

*3 

x\  +  x2  +  *3  =  0,  x{  —  x2  +  2x3  =  0,  3xx-x2  +  5x3  =  0 


In  each  of  Problems  7-11  determine  whether  the  given  set  of  functions 
form  a  vector  space  under  the  usual  operations  of  function  addition  and 
multiplication  of  a  function  by  a  constant. 

7.  The  set  of  all  polynomials  of  degree  <  4 

8.  The  set  of  all  differentiable  functions 

9.  The  set  of  all  differentiable  functions  whose  derivative  at  t  =  1  is  three 


10.  The  set  of  all  solutions  of  the  differential  equation  y"  +y  =  cos t 

11.  The  set  of  all  functions  y{t)  which  have  period  2vr,  that  isy(/  +  277-)=y(/) 


12.  Show  that  the  set  of  all  vector-valued  solutions 

*i(0 


x(/)  = 


x2(t) 


of  the  system  of  differential  equations 


dxx 

dt 


=  *2+  1> 


dx2 

dt 


—  xx  + 1 


is  not  a  vector  space. 


3.3  Dimension  of  a  vector  space 

Let  V  be  the  set  of  all  solutions  y(t)  of  the  second-order  linear  homoge¬ 
neous  equation  (d2y / dt2)+  p(t)(dy / dt)  +  q(t)y  =0.  Recall  that  every  solu¬ 
tion  y(t)  can  be  expressed  as  a  linear  combination  of  any  two  linearly  inde¬ 
pendent  solutions.  Thus,  if  we  knew  two  “independent”  functions  y\t)  and 
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y2(t)  in  V,  then  we  could  find  every  function  in  V  by  taking  all  linear  com¬ 
binations  cxy\t)  +  c2y2(t)  of yl  and>>2.  We  would  like  to  derive  a  similar 
property  for  solutions  of  the  equation  x  =  Ax.  To  this  end,  we  define  the 
notion  of  a  finite  set  of  vectors  generating  the  whole  space,  and  the  notion 
of  independence  of  vectors  in  an  arbitrary  vector  space  V. 

Definition.  A  set  of  vectors  x1,x2,...,x/I  is  said  to  span  V  if  the  set  of  all 
linear  combinations  CjX1  -I- c2x2+  •••  +cnxft  exhausts  V.  That  is  to  say, 
the  vectors  x*,x2, . . . ,xw  span  V  if  every  element  of  V  can  be  expressed  as 
a  linear  combination  of  x1,x2,...,x'1. 

Example  1.  Let  V  be  the  set  of  all  solutions  of  the  differential  equation 
{d2x  /  dt2)  —  x  =  0.  Let  xl  be  the  function  whose  value  at  any  time  Ms  e* 
and  let  x2  be  the  function  whose  value  at  any  time  t  is  e~‘.  The  functions 
xl  and  x2  are  in  V  since  they  satisfy  the  differential  equation.  Moreover, 
these  functions  also  span  V  since  every  solution  x(t)  of  the  differential 
equation  can  be  written  in  the  form 

x(t)  =  cle'  +  c2e-' 

so  that 

x  =  c1xl  +  c2x2. 


Example  2.  Let  V  =  R"  and  let  eJ  denote  the  vector  with  a  1  in  the yth  place 
and  zeros  everywhere  else,  that  is, 


II 

la 

1 1 
0 

0 

,  e2  = 

0 

l 

0 

• 

e"  = 

, . . . ,  ^ 

- \ 

.  .  .  O  © 

J 

t - 

V _ 

• 

• 

0 

k  J 

0 

l 

k  J 

The  set  of  vectors  e\e2,...,e"  span  R"  since  any  vector 

*2 


X  = 


can  be  written  in  the  form 


'  xx' 

r  o  i 

'  0  ' 

0 

• 

+ 

*2 

• 

+  . . .  + 

0 

• 

t 

• 

0  J 

• 

• 

0 

^  J 

• 

*«. 

k.  J 

=  x{el  +  x2e2  4- . . .  +  xnen. 
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Definition.  A  set  of  vectors  x*,x2, . . . ,x"  in  V  is  said  to  be  linearly  dependent 
if  one  of  these  vectors  is  a  linear  combination  of  the  others.  A  very  pre¬ 
cise  mathematical  way  of  saying  this  is  as  follows.  A  set  of  vectors 
x\x2, ...,xn  is  said  to  be  linearly  dependent  if  there  exist  constants 
Cj,c2, .  ..,c„,  not  all  zero  such  that 

e1x1  +  c2x2+...+c„x',=0. 

These  two  definitions  are  equivalent,  for  if  xj  is  a  linear  combination  of 
x1,...,x/_1,x/+1,...,x'1,  that  is 

XJ  =  C,x‘  +  . . .  +  Cj_  1  +  Cj+  ,X"/  +  >+...+  c„x", 

then  the  linear  combination 

c,x‘  +  . . .  +  Cj_  ,x>-  '-xJ  +  cJ+lxJ+ 1  +  . . .  +  Cnxn 

equals  zero  and  not  all  the  constants  are  zero.  Conversely,  if  c,x]  +  c2x2 
+  . . .  +  cnxn  =  0  and  Cj ;¥=  0  for  some  y,  then  we  can  divide  by  Cj  and  solve  for 
xJ  as  a  linear  combination  of  x1, . . . , xJ ~  \ xJ  +  *, . . . , xn.  For  example,  if  cx  =^0 
then  we  can  divide  by  c,  to  obtain  that 


Definition.  If  the  vectors  x\x2, ...,x”  are  not  linearly  dependent,  that  is, 
none  of  these  vectors  can  be  expressed  as  a  linear  combination  of  the 
others,  then  they  are  said  to  be  linearly  independent.  The  precise 
mathematical  way  of  saying  this  is  that  the  vectors  x1,x2,...,x'1  are  lin¬ 
early  independent  if  the  equation 

c,x'  +  c2x2+...+crtx"=0 

implies,  of  necessity,  that  all  the  constants  cl9c2,...,cn  are  zero. 

In  order  to  determine  whether  a  set  of  vectors  x\x2, ...,xn  is  linearly  de¬ 
pendent  or  linearly  independent,  we  write  down  the  equation  cxxl  +  c2x2 
+ ...  +c„xw  =  0  and  see  what  this  implies  about  the  constants  c,,c2,...,c„. 
If  all  these  constants  must  be  zero,  then  x*,x2,. . .,xw  are  linearly  indepen¬ 
dent.  On  the  other  hand,  if  not  all  the  constants  cl9c29..*,cn  must  be  zero, 
then  x^x2,...^”  are  linearly  dependent. 

Example  3.  Let  V  =  R3  and  let  x1,  x2,  and  x3  be  the  vectors 

f  n  rn  r 3 

x*=  -1  ,  x2=  2  and  x3=  0  • 

1  3  5 
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To  determine  whether  these  vectors  are  linearly  dependent  or  linearly  inde¬ 
pendent,  we  write  down  the  equation  c1x1  +  c2x2  +  c3x3  =  0,  that  is 


r 

T 

'3' 

01 

Cl 

-l 

.  i. 

+  c2 

2 

L  3  J 

+  c3 

0 

1 5  J 

0 

LoJ 

The  left-hand  side  of  this  equation  is  the  vector 


Cj  +  c2  +  3c3 
—  Cj  +  2c2 

c  |  4-  3c2  4-  5c3 

<  - 

Hence  the  constants  cl9  c2 ,  and  c3  must  satisfy  the  equations 

ci  +  c2  +  3c3  =  0,  (i) 

—  c1  +  2c2  =  0,  (ii) 

c\  +  3c2  +  5c3  =  0.  (iii) 

Equation  (ii)  says  that  cx  =  2c2.  Substituting  this  into  Equations  (i)  and  (iii) 
gives 

3c24-3c3  =  0  and  5c2  +  5c3  =  0. 

These  equations  have  infinitely  many  solutions  c2,c3  since  they  both  reduce 
to  the  single  equation  c2  +  c3  =  0.  One  solution,  in  particular,  is  c2=  —  1, 
c3=  1.  Then,  from  Equation  (ii),  ct=  -2.  Hence, 
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[3] 

ro' 

-1 

— 

2 

+ 

0 

= 

0 

1 

V  J 

L  3  J 

5 

L  J 

LoJ 

and  x1,  x2  and  x3  are  linearly  dependent  vectors  in  R3. 


Example  4.  Let  V  =  R"  and  let  e1,e2,...,e"  be  the  vectors 


II 
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To  determine  whether  e\e2,...,en  are  linearly  dependent  or  linearly  inde¬ 
pendent,  we  write  down  the  equation  c^e1 4- . . .  4-  cnen  =  0,  that  is 
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+  . . .  +  cn 

• 

• 

• 

0 

— 

0 

• 

• 

• 

0 

J 

0 

k.  J 

1 

L  J 

0 

L  J 

282 


3.3  Dimension  of  a  vector  space 


The  left-hand  side  of  this  equation  is  the  vector 


Hence  cx  =0,c2  =  0,...,  and  c„  =  0.  Consequently,  e^e2, ...,en  are  linearly 
independent  vectors  in  Rw. 

Definition.  The  dimension  of  a  vector  space  V,  denoted  by  dimV,  is  the 
fewest  number  of  linearly  independent  vectors  which  span  V.  V  is  said 
to  be  a  finite  dimensional  space  if  its  dimension  is  finite.  On  the  other 
hand,  V  is  said  to  be  an  infinite  dimensional  space  if  no  set  of  finitely 
many  elements  span  V. 

The  dimension  of  a  space  V  can  be  characterized  as  the  fewest  number 
of  elements  that  we  have  to  find  in  order  to  know  all  the  elements  of  V.  In 
this  sense,  the  definition  of  dimension  captures  our  intuitive  feeling.  How¬ 
ever,  it  is  extremely  difficult  to  compute  the  dimension  of  a  space  V  from 
this  definition  alone.  For  example,  let  V  =  R".  We  have  shown  in  Examples 
2  and  4  that  the  vectors  e1,e2,...,e'1  are  linearly  independent  and  span  V. 
Moreover,  it  seems  intuitively  obvious  to  us  that  we  cannot  generate  Rn 
from  fewer  than  n  vectors.  Thus,  the  dimension  of  R"  should  be  n.  But  how 
can  we  prove  this  rigorously?  To  wit,  how  can  we  prove  that  it  is  impossi¬ 
ble  to  find  a  set  of  ( n  —  1)  linearly  independent  vectors  that  span  R"?  Thus, 
our  definition  of  dimension  is  not,  as  yet,  a  very  useful  one.  However,  it 
will  become  extremely  useful  after  we  prove  the  following  theorem. 


Theorem  2.  If  n  linearly  independent  vectors  span  V,  then  dim  V  =  n. 

We  will  need  two  lemmas  to  prove  Theorem  2.  The  first  lemma  con¬ 
cerns  itself  with  the  solutions  of  simultaneous  linear  equations  and  can  be 
motivated  as  follows.  Suppose  that  we  are  interested  in  determining  n  un¬ 
known  numbers  xl,x2,...,xn  uniquely.  It  seems  pretty  reasonable  that  we 
should  be  given  n  equations  satisfied  by  these  unknowns.  If  we  are  given 
too  few  equations  then  there  may  be  many  different  solutions,  that  is, 
many  different  sets  of  values  for  x{,x2,...,xn  which  satisfy  the  given  equa¬ 
tions.  Lemma  1  proves  this  in  the  special  case  that  we  have  m  homoge¬ 
neous  linear  equations  for  n  >  m  unknowns. 

Lemma  1.  A  set  of  m  homogeneous  linear  equations  for  n  unknowns 

X\9 x2 , . . . , Xn  always  admits  a  nontrivial  solution  if  m<n.  That  is  to  say , 
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the  set  of  m  equations  in  n  unknowns 

axxxx  +  aX2x2  +  ...  +  aXnxn= 0 

“I"  a22X2  +  •••“*”  a2nXn~® 

(l) 

<>m\X\  ^ m2X2  "^  *  *  *  "^"  ^ mnXn  ^ 

always  has  a  solution  xx, x2, . . . , xn,  other  than  xx  =  . . .  =  xn  =  0,  if  m<n. 

Remark.  Notice  that  xx  =  0,  x2  =  0, . . . ,  xn  =  0  is  certainly  one  solution  of  the 
system  of  equations  (1).  Thus,  Lemma  1  is  telling  us  that  these  equations 
have  more  than  one  solution. 

Proof  of  lemma  1.  We  will  prove  Lemma  1  by  induction  on  m.  To  this 
end,  observe  that  the  lemma  is  certainly  true  if  m  =  1,  for  in  this  case  we 
have  a  single  equation  of  the  form  axxxx  +  al2x2  +  ...  +  aXnxn  =  0,  with 
n>  2.  We  can  find  a  nontrivial  solution  of  this  equation,  if  axx  =  0,  by 
taking  xx  —  \9x2  =  0, ...,xw=0.  We  can  find  a  nontrivial  solution  of  this 
equation,  if  axx^=  0,  by  taking  x2  =  1, . . . , xn  =  1  and  xx  =  —  ( aX2  +  . . .  4- 

For  the  next  step  in  our  induction  proof,  we  assume  that  Lemma  1  is 
true  for  some  integer  m  —  k  and  show  that  this  implies  that  Lemma  1  is 
true  for  m  =  k  + 1,  and  k  + 1  <  n.  To  this  end,  consider  the  k  + 1  equations 
for  the  n  unknowns  xx,x2y.9xn 


auxx  +  aX2x2  +...+  aXnxn  =0 

^21*1  +  *22*2  +•••+  a2nxn  =0 


^*+1,1*1  +  ak+\,2X2  +  +ak+\,nXn 


with  k+l<n.  If  axx,a2l9...,ak+x  x  are  all  zero,  then  xx  =  1,x2  =  0,...,a:ai  =  0 
is  clearly  a  non-trivial  solution.  Hence,  we  may  assume  that  at  least  one  of 
these  coefficients  is  not  zero.  Without  any  loss  of  generality,  we  may 
assume  that  axx¥z 0,  for  otherwise  we  can  take  the  equation  with  the  non¬ 
zero  coefficient  of  xx  and  relabel  it  as  the  first  equation.  Then 
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*i  =  - 


12 


a 


a 


x2 


13 
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... 


In 


a 


x„. 


11 


Substituting  this  value  of  x,  in  the  second  through  the  ( k  +  l)st  equations, 


284 


3.3  Dimension  of  a  vector  space 


we  obtain  the  equivalent  equations 


a  n-*]+  ^12-^2 

+ 

a, 3*3 

+ 

.  .  .  + 

=  0 

bi2x2 

+ 

Z>23*3 

+ 

.  .  .  + 

b2nX» 

=  0 

£*2*2  +  £*3*3  +•••+  £*„*„  =0 

£*  +  1,2*2  +  £*+  1,3*3+  •••  +£*+l,n*n  =  0 


where  by  —  a^  —  a^ay/a^.  Now,  the  last  k  equations  of  (3)  are  k  homoge¬ 
neous  linear  equations  for  the  (n  —  1)  unknowns  x2,...,xn.  Moreover,  k  is 
less  than  n—  1  since  k+  1  is  less  than  n.  Hence,  by  the  induction  hypothe¬ 
sis,  these  equations  have  a  nontrivial  solution  x29...,xn.  Once  x2,.-.,xn  are 
known,  we  have  as  before  xx  =  —{anx2^  •••  +  a\nxn)/a\\  from  the  first 
equation  of  (3).  This  establishes  Lemma  1  for  m  =  k+  1,  and  therefore  for 
all  m ,  by  induction.  □ 

If  a  vector  space  V  has  dimension  m ,  then  it  has  m  linearly  independent 
vectors  x\...,xw  and  every  vector  in  the  space  can  be  written  as  a  linear 
combination  of  the  m  vectors  x\x2,...,xm.  It  seems  intuitively  obvious  to 
us  in  this  case  that  there  cannot  be  more  than  m  linear  independent  vectors 
in  V.  This  is  the  content  of  Lemma  2. 


Lemma  2.  In  an  m  dimensional  space ,  any  set  of  n>m  vectors  must  be  lin¬ 
early  dependent.  In  other  words ,  the  maximum  number  of  linearly  indepen¬ 
dent  vectors  in  a  finite  dimensional  space  is  the  dimension  of  the  space. 

Proof.  Since  V  has  dimension  m,  there  exist  m  linearly  independent  vec¬ 
tors  x\x2,...,xm  which  span  V.  Let  y^yV.^yHea  set  of  n  vectors  in  V, 
with  n>m.  Since  x1,x2,...,xm  span  V,  all  the  y7  can  be  written  as  linear 
combinations  of  these  vectors.  That  is  to  say,  there  exist  constants  a0,\<i 
<  n,  1  <  j  <  m  such  that 

y1  =  tf11x1  +  tf12x2+  ...  +  almxm 


y2  =  a2  |X‘  +  a22x2  +...+  a2mxm 


V  =  a«ix'  +  a„2x2  +  •  •  •  +  a„mxm. 
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To  determine  whether  y1,y2,...,yw  are  linearly  dependent  or  linearly  inde¬ 
pendent,  we  consider  the  equation 

<’iy‘  +  c2y2+  ...+c„y"=0.  (5) 

Using  (4)  we  can  rewrite  (5)  in  the  form 

0  =  c,y1  +  c2y2+...  +  c„y” 

=  (c1a11+...  +  c„a„1)x,  +  (c,a12+...+cna„2)x2 

+  ...+(c1aim+  ...+cna„Jxm. 

This  equation  states  that  a  linear  combination  of  xl,x2,...,xm  is  zero.  Since 
x',x2,...,xm  are  linearly  independent,  all  these  coefficients  must  be  zero. 
Hence, 

CjUu  +  c2a2\  +  ...  +  cnanI  =0 


clal2  +  c2a22  +  ...  +  cna„2=0 


Clal  m  +  c2a2m  +  •  •  •  +  Cnanm  = 

Now,  observe  that  the  system  of  Equations  (6)  is  a  set  of  m  homogeneous 
linear  equations  for  n  unknowns  c,,c2,  ...,cw,  with  n>m.  By  Lemma  1, 
these  equations  have  a  nontrivial  solution.  Thus,  there  exist  constants 
q, c2, . . . , cn,  not  all  zero,  such  that  qy1  4-  q y2  4- ...  4-  cn y"  =  0.  Consequently, 
y  ^ 1 ,  y2,  •  •  • ,  Y1  are  linearly  dependent.  □ 

We  are  now  in  a  position  to  prove  Theorem  2. 

Proof  of  theorem  2.  If  n  linearly  independent  vectors  span  V,  then,  by 
the  definition  of  dimension,  dimV<«.  By  Lemma  2,  «<dimV.  Hence, 
dimV  =  A2.  □ 

Example  5.  The  dimension  of  R"  is  n  since  e1,e2,...,e'1  are  n  linearly  inde¬ 
pendent  vectors  which  span  R". 


Example  6.  Let  V  be  the  set  of  all  3  X  3  matrices 


A  = 


f  a 
a 
a 


11 

a\2 

al3 

21 

a22 

a23 

31 

an 

a33 

and  let  E^  denote  the  matrix  with  a  one  in  the  ith  row,  jth  column  and 
zeros  everywhere  else.  For  example, 


V 


0  1 

0  oj 
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To  determine  whether  these  matrices  are  linearly  dependent  or  linearly  in¬ 
dependent,  we  consider  the  equation 


3 


ij  =  1 


CyEy  =  °  = 


0 

0 


0  o' 
0  0 
0  oj 


Now,  observe  that  the  left-hand  side  of  (7)  is  the  matrix 
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0 

01 

0 
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01 
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cll 
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+  C,2 
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+  •  •  •  +  C33 
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oj 

0 
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1 
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C3l 

< 

c32 

C33 

J 

Equating  this  matrix  to  the  zero  matrix  gives  cu  =0,c12  =  0, ...,c33  =0. 
Hence  the  9  matrices  are  linearly  independent.  Moreover,  these  9 
matrices  also  span  V  since  any  matrix 


«ii 

an 

a  13' 

A  = 

«21 

a  22 

a23 

«31 

a32 

a33 

3 

can  obviously  be  written  in  the  form  A=  2j  Hence  dimV  =  9. 


Definition.  If  a  set  of  linearly  independent  vectors  span  a  vector  space  V, 
then  this  set  of  vectors  is  said  to  be  a  basis  for  V.  A  basis  may  also  be 
called  a  coordinate  system.  For  example,  the  vectors 


C  "N 

1 

roi 

[01 

0 

e1  = 

0 

0 

.0. 

,  e2  = 

i 

0 

[oj 

,  e3  = 

0 

1 

L0j 

and  e4  = 

0 

0 

L  i  J 

are  a  basis  for  R4.  If 


x 


then  x  =  x{el  +  x2e2  +  x3e3  +  x4e4,  and  relative  to  this  basis  the  xt  are 
called  “components”  or  “coordinates.” 


Corollary.  In  a  finite  dimensional  vector  space ,  each  basis  has  the  same  num¬ 
ber  of  vectors ,  and  this  number  is  the  dimension  of  the  space. 


The  following  theorem  is  extremely  useful  in  determining  whether  a  set 
of  vectors  is  a  basis  for  V. 
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Theorem  3.  Any  n  linearly  independent  vectors  in  an  n  dimensional  space  V 
must  also  span  V.  That  is  to  say ,  any  n  linearly  independent  vectors  in  an  n 
dimensional  space  V  are  a  basis  for  V. 

Proof.  Let  x1,x2,...,x'*  be  n  linearly  independent  vectors  in  an  n  dimen¬ 
sional  space  V.  To  show  that  they  span  V,  we  must  show  that  every  x  in  V 
can  be  written  as  a  linear  combination  of  x1,x2,...,x".  To  this  end,  pick 
any  x  in  V  and  consider  the  set  of  vectors  x,x1,x2,...,x'\  This  is  a  set  of 
{n+  1)  vectors  in  the  n  dimensional  space  V;  by  Lemma  2,  they  must  be 
linearly  dependent.  Consequently,  there  exist  constants  c,c1,c2,...,c/I,  not 
all  zero,  such  that 

cx+c1x1  +  c2x2+  ...  +c„x'I  =  0.  (8) 

Now  c- t^O,  for  otherwise  the  set  of  vectors  x1,x2,...,x'1  would  be  linearly 
dependent.  Therefore,  we  can  divide  both  sides  of  (8)  by  c  to  obtain  that 


Hence,  any  n  linearly  independent  vectors  in  an  n  dimensional  space  V 
must  also  span  V.  □ 

Example  7.  Prove  that  the  vectors 

*'-(])  and  *■-(_!) 


form  a  basis  for  R2. 

Solution.  To  determine  whether  x1  and  x2  are  linearly  dependent  or  lin¬ 
early  independent,  we  consider  the  equation 

c,x1  +  c2x2  =  c,(  J  )  +  c2(  _[  )  =  (q)-  (9) 

Equation  (9)  implies  that  cx  +  c2  =  0  and  c,  —  c2  =  0.  Adding  these  two  equa¬ 
tions  gives  cx  =0  while  subtracting  these  two  equations  gives  c2  =  0.  Conse¬ 
quently,  x1  and  x2  are  two  linearly  independent  vectors  in  the  two  dimen¬ 
sional  space  R2.  Hence,  by  Theorem  3,  they  must  also  span  V. 

Exercises 

In  each  of  Exercises  1-4,  determine  whether  the  given  set  of  vectors  is  lin¬ 
early  dependent  or  linearly  independent. 
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—  1  I  and 


4.  2  , 

V  6  / 


-'D- 


-n 

0  and 
\ 


5.  Let  V  be  the  set  of  all  2x2  matrices.  Determine  whether  the  following  sets  of 
matrices  are  linearly  dependent  or  linearly  independent  in  V. 

«(!  X>  (i  ?>  (1  i)  -  (X  !) 

®c  s>  (i  :)•  (?  i)  -  (.?  -?> 

6.  Let  V  be  the  space  of  all  polynomials  in  t  of  degree  <  2. 

(a)  Show  that  dim  V  =  3. 

(b)  Let  pl9  p2  and  p3  be  the  three  polynomials  whose  values  at  any  time  /  are 
(/-  l)2,  (/  —  2)2,  and  (/  —  1)  (t  —  2)  respectively.  Show  that p\,p2 ,  and  p3  are 
linearly  independent.  Hence,  conclude  from  Theorem  3  that  ph  p2,  and  p3 
form  a  basis  for  V. 

7.  Let  V  be  the  set  of  all  solutions  of  the  differential  equation  d2y / dt 2  —y  =  0. 

(a)  Show  that  V  is  a  vector  space. 

(b)  Find  a  basis  for  V. 

8.  Let  V  be  the  set  of  all  solutions  of  the  differential  equation  (d3y/dt3)+y=  0 
which  satisfy  /(0)  =  0.  Show  that  V  is  a  vector  space  and  find  a  basis  for  it. 

9.  Let  V  be  the  set  of  all  polynomials p(t)  =  a0  +  axt  +  a2t2  which  satisfy 

p  (0)  +  2p'(0)  -I-  3p"(0)  =  0. 

Show  that  V  is  a  vector  space  and  find  a  basis  for  it. 


10.  Let  V  be  the  set  of  all  solutions 


of  the  differential  equation 


x,(0 

x2(t) 

*3(0 


1  °\ 

0  1  |x. 

-II  6/ 


Show  that 


x>(0  = 


el  5 


e  e 

x2(0=  2e2’  ,  and  x3(0=  3e3' 

4e2'  9e3' 


form  a  basis  for  V. 
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11.  Let  V  be  a  vector  space.  We  say  that  W  is  a  subspace  of  V  if  W  is  a  subset  of  V 
which  is  itself  a  vector  space.  Let  W  be  the  subset  of  R3  which  consists  of  all 
vectors 


*i\ 
x 2 


which  satisfy  the  equations 


x\  +  x2  +  2x3  =  0 
2xx  —  X2  +  *3  =  0 
6xj  +6x3  =  0. 


Show  that  W  is  a  subspace  of  R3  and  find  a  basis  for  it. 

12.  Prove  that  any  n  vectors  which  span  an  n  dimensional  vector  space  V  must  be 
linearly  independent.  Hint :  Show  that  any  set  of  linearly  dependent  vectors 
contains  a  linearly  independent  subset  which  also  spans  V. 

13.  Let  v^v2,...,^  be  n  vectors  in  a  vector  space  V.  Let  W  be  the  subset  of  V 
which  consists  of  all  linear  combinations  c1v1  +  c2v2+  ...  +cnvn  of  v^v2,...,^. 
Show  that  W  is  a  subspace  of  V,  and  that  dimW  <  n. 

14.  Let  V  be  the  set  of  all  functions  f(t)  which  are  analytic  for  |f|  <  1,  that  is,  /(/) 
has  a  power  series  expansion / (/)  =  a0+  a\t  +  a2t2  +  ...  which  converges  for  |/| 
<  1.  Show  that  V  is  a  vector  space,  and  that  its  dimension  is  infinite.  Hint :  V 
contains  all  polynomials. 

15.  Let  v1^2,...,^  be  m  linearly  independent  vectors  in  an  n  dimensional  vector 
space  V,  with  n>m.  Show  that  we  can  find  vectors  vm+1,...,vn  so  that 
v1,v2,...,vw,vm+1,...,v'1  form  a  basis  for  V.  That  is  to  say,  any  set  of  m  linearly 
independent  vectors  in  an  n  >  m  dimensional  space  V  can  be  completed  to 
form  a  basis  for  V. 


16.  Find  a  basis  for  R3  which  includes  the  vectors 


M 

1  I  and 

0/ 


I- 


17.  (a)  Show  that 


v 


1 

0 

0 


and  v3  = - 

V2 


are  linearly  independent  in  R3. 
(b)  Let 


x  = 


x\  \ 

x2  I  =  x1e1  +  x2e2  +  x3e3. 
*3/ 
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Since  v^v2,  and  v3  are  linearly  independent  they  are  a  basis  and  x=y1v1  + 
y2\2  +y3v3.  What  is  the  relationship  between  the  original  coordinates  x(  and 
the  new  coordinates  yp 

(c)  Express  the  relations  between  coordinates  in  the  form  x  =  By.  Show  that  the 
columns  of  B  are  v1,  v2,  and  v3. 


3.4  Applications  of  linear  algebra 
to  differential  equations 

Recall  that  an  important  tool  in  solving  the  second-order  linear  homoge¬ 
neous  equation  (d2y  /  dt2)+ p(t){dy  /  dt)+ q{t)y  =  0  was  the  existence- 
uniqueness  theorem  stated  in  Section  2.1.  In  a  similar  manner,  we  will 
make  extensive  use  of  Theorem  4  below  in  solving  the  homogeneous  linear 
system  of  differential  equations 


dx 

dt 


•V 

f  „ 

Q 11 

•  •  • 

(*\n 

=  Ax, 

x  = 

• 

• 

0 

<  J 

,  A  = 

f - 

... 

«3 

•  •  • 

0 

0 

0 

®nn 

The  proof  of  this  theorem  will  be  indicated  in  Section  4.6. 


Theorem  4  (Existence-uniqueness  theorem).  There  exists  one ,  and  only 
one ,  solution  of  the  initial-value  problem 


Moreover ,  this  solution  exists  for  —  oo  <  t  <  oo. 


Theorem  4  is  an  extremely  powerful  theorem,  and  has  many  implica¬ 
tions.  In  particular,  if  x(/)  is  a  nontrivial  solution,  then  x(/)^=0  for  any  t. 
(If  x(/*)  =  0  for  some  t*9  then  x(/)  must  be  identically  zero,  since  it,  and  the 
trivial  solution,  satisfy  the  same  differential  equation  and  have  the  same 
value  at  t  =  t*.) 

We  have  already  shown  (see  Example  7,  Section  3.2)  that  the  space  V  of 
all  solutions  of  (1)  is  a  vector  space.  Our  next  step  is  to  determine  the  di¬ 
mension  of  V. 


Theorem  5.  The  dimension  of  the  space  V  of  all  solutions  of  the  homogeneous 
linear  system  of  differential  equations  (1)  is  n. 

Proof.  We  will  exhibit  a  basis  for  V  which  contains  n  elements.  To  this 
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end,  let  $J(t)9j=l9...9nbe  the  solution  of  the  initial-value  problem 


r 


dx 

dt 


0 

1 

0 


— j th  row. 


0 


For  example,  ^>\t)  is  the  solution  of  the  differential  equation  (1)  which 
satisfies  the  initial  condition 


^I(0)  =  eI  = 


Note  from  Theorem  4  that  <f >J(t)  exists  for  all  t  and  is  unique.  To  de¬ 
termine  whether  are  linearly  dependent  or  linearly  indepen¬ 

dent  vectors  in  V,  we  consider  the  equation 

+  •••  =  ®  (4) 

where  the  zero  on  the  right-hand  side  of  (4)  stands  for  the  zero  vector  in  V 
(that  is,  the  vector  whose  every  component  is  the  zero  function).  We  want 
to  show  that  (4)  implies  cx  =  c2  =  . . .  =  cn  =  0.  Evaluating  both  sides  of  (4)  at 
t  =  0  gives 


1 

0 

0 


c\ V  (0)  +  ^2(0)  +  •  •  •  +  cn§n  (0)  = 


0 

0 


0 


or 


cxel  +  c2e2  +  . . .  4-  cnen  =  0. 

Since  we  know  that  e!,e2, ...,e"  are  linearly  independent  in  R",  cx  =  c2 
==  ...  =cw  =  0.  We  conclude,  therefore,  that  ^>1,^2,...,^w  are  linearly  inde¬ 
pendent  vectors  in  V. 

Next,  we  claim  that  tfr1,#2,...,#”  also  span  V.  To  prove  this,  we  must 
show  that  any  vector  x  in  V  (that  is,  any  solution  x(/)  of  (1))  can  be  written 
as  a  linear  combination  of  ^>1,^2,...,^w.  To  this  end,  pick  any  x  in  V,  and 
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let 


c  = 


be  the  value  of  x  at  /  =  0  (x(0)  =  c).  With  these  constants  cl9c2,...9cn9  con¬ 
struct  the  vector-valued  function 

^(0==ci^1(0  +  c2^2(0+  •••  +^$"(0' 


We  know  that  ^(/)  satisfies  (1)  since  it  is  a  linear  combination  of  solutions. 
Moreover, 

£(0)  =  c,^1  (0)  +  c2<!>2(  0)  +  . . .  +  cnr  (0) 


Now,  observe  that  x(/)  and  <f>(t)  satisfy  the  same  homogeneous  linear  sys¬ 
tem  of  differential  equations,  and  that  x(/)  and  <j>(t)  have  the  same  value  at 
/  =  0.  Consequently,  by  Theorem  4,  x(/)  and  ^(/)  must  be  identical,  that  is 

x(/)=^(/)  =  Ci^1(/)H-c2^2(/)h-  ...  +  cn^n{t). 

Thus,  also  span  V.  Therefore,  by  Theorem  2  of  Section  3.3, 

dimV  =  «.  □ 


Theorem  5  states  that  the  space  V  of  all  solutions  of  (1)  has  dimension 
n.  Hence,  we  need  only  guess,  or  by  some  means  find,  n  linearly  indepen¬ 
dent  solutions  of  (1).  Theorem  6  below  establishes  a  test  for  linear  indepen¬ 
dence  of  solutions.  It  reduces  the  problem  of  determining  whether  n  solu¬ 
tions  x1,x2,...,x/I  are  linearly  independent  to  the  much  simpler  problem  of 
determining  whether  their  values  x1(f0),x2(f0),...,x',(^0)  at  an  appropriate 
time  t0  are  linearly  independent  vectors  in  R". 

Theorem  6  (Test  for  linear  independence).  Let  x\x2,...,x*  be  k  solutions 
of  x  =  Ax.  Select  a  convenient  t0.  Then ,  x\...,x*  are  linear  independent 
solutions  if  and  only  if  xl(t0),x2(t0),...,xk(t0)  are  linearly  independent 
vectors  in  R". 


Proof.  Suppose  that  x\x2,...,x*  are  linearly  dependent  solutions.  Then, 
there  exist  constants  cX9cl9...9ck9  not  all  zero,  such  that 

cxxl  +  c2x?‘+  ...  +  c*x/c  =  0. 
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Evaluating  this  equation  at  t  =  t0  gives 


Cix'(/0)  +  c2x200)  +  . . .  +  ckxk(t0)  = 


Hence  x1(/0),x2(/0),...,xA:(/0)  are  linearly  dependent  vectors  in  Rn. 

Conversely,  suppose  that  the  values  of  x^x2, at  some  time  t0  are 
linearly  dependent  vectors  in  R".  Then,  there  exist  constants  cx,c2,...,ck, 
not  all  zero,  such  that 


0 

0 

0 


Cjx'^o)  +  C2X2(/0)  +  . . .  +  Ckxk(t0 )  = 


With  this  choice  of  constants  cx,c2,...,ck,  construct  the  vector-valued  func¬ 
tion 

<£(f)  =  CxXl(t)  +  c2x2(7)  +  ...  +ckxk(t). 

This  function  satisfies  (1)  since  it  is  a  linear  combination  of  solutions. 
Moreover,  ^(^o)  =  0.  Hence,  by  Theorem  4,  ^>(/)  =  0  for  all  t .  This  implies 
that  x1,  x2, . . . ,  x*  are  linearly  dependent  solutions.  □ 


Example  1.  Consider  the  system  of  differential  equations 
dxx 

-(_?  -ib  *-(£)• 


d,  -*> 


dx2 

-^-  =  -xx-lx2 


dx 

dt 


This  system  of  equations  arose  from  the  single  second-order  equation 

d2y  ^ dy 
-4+2-t- +>>  =  0 
dt 2  dt 


(5) 


(6) 


by  setting  xx  =y  and  x2  =  dy /dt.  Since  yx(t)  =  e  1  and  y2(t)  =  te  1  are  two 
solutions  of  (6),  we  see  that 


- 1 


—  e 


and 


te  1 

(1  ~t)e- 
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are  two  solutions  of  (5).  To  determine  whether  x1  and  x2  are  linearly  de¬ 
pendent  or  linearly  independent,  we  check  whether  their  initial  values 

x>(0)  =  (_|) 

and 


are  linearly  dependent  or  linearly  independent  vectors  in  R2.  Thus,  we  con¬ 
sider  the  equation 


c,x,(0)  +  c2x2(0)  =  (  _c^Vc2)  =  (o)- 

This  equation  implies  that  both  cx  and  c2  are  zero.  Hence,  x!(0)  and  x2(0) 
are  linearly  independent  vectors  in  R2.  Consequently,  by  Theorem  6,  x1^) 
and  x2(t)  are  linearly  independent  solutions  of  (5),  and  every  solution  x(7) 
of  (5)  can  be  written  in  the  form 


\cx  +  c2t)e  '  \ 

,(c2-cl-c2t)e~t ) 


Example  2.  Solve  the  initial-value  problem 


dx  /  0 

dt  l  —  1 


1 


x, 


Solution.  From  Example  1,  every  solution  x(t)  must  be  of  the  form  (7).  The 
constants  cx  and  c2  are  determined  from  the  initial  conditions 


Therefore,  cx  =  1  and  c2  =  1  +  c,  =  2.  Hence 


(1+2 t)e  l\ 
(1-2/)*-'] 


Up  to  this  point  in  studying  (1)  we  have  found  the  concepts  of  linear  al¬ 
gebra  such  as  vector  space,  dependence,  dimension,  basis,  etc.,  and  vector- 
matrix  notation  useful,  but  we  might  well  ask  is  all  this  other  than  simply 
an  appropriate  and  convenient  language.  If  it  were  nothing  else  it  would  be 
worth  introducing.  Good  notations  are  important  in  expressing  mathemati¬ 
cal  ideas.  However,  it  is  more.  It  is  a  body  of  theory  with  many  applica¬ 
tions. 
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In  Sections  3.8-3.10  we  will  reduce  the  problem  of  finding  all  solutions 
of  (1)  to  the  much  simpler  algebraic  problem  of  solving  simultaneous  linear 
equations  of  the  form 


a\  1*1  +  a  12*2  +  •  •  •  +  aXn  xn  =  Z>, 
a2\x\^~  a22x2'^'  •••+  02„Xn  =  £>2 

an\x\  +  a„2*2  +  •  •  •  +  annxn  =  b„ 

Therefore,  we  will  now  digress  to  study  the  theory  of  simultaneous  linear 
equations.  Here  too  we  will  see  the  role  played  by  linear  algebra. 

Exercises 

In  each  of  Exercises  1-4  find  a  basis  for  the  set  of  solutions  of  the  given 
differential  equation. 

.  x  =  ^  J  (Hint:  Find  a  second-order  differential  equation  satis¬ 

fied  by  xj(O-) 


1 


0  1  0 

2.  x  =  |  0  0  1  |x 

2  -12 
fied  by  x j(7).) 


(Hint:  Find  a  third-order  differential  equation  satis- 


‘•*=(2  ?): 


1  0  0 

4.  x=|  1  1  0  |x 

1  1  1 


For  each  of  the  differential  equations  5-9  determine  whether  the  given 
solutions  are  a  basis  for  the  set  of  all  solutions. 


•*=(_?  J)*;  x'w=(_'!)’x2«=(^',) 


(  4 

-2  2\ 

■  0  ’ 

r  , 

e6' 

“Vi 

3  1  x;  x\t)  = 

-1  5/ 

e2t 

0 

c  j 

,  x2(0  = 

e4' 

[e4'J 

,  x3(0  = 

0 

e6' 

^  J 

7.  x  = 


3  -2  3  \ 

k2'l 

f - 

(N 

v _ 

r 

1 

1 

_ 

1  0  -3  x;  x‘(/)  = 

1  -2  -if 

e“2' 

e~2t 

< w  j 

.  x2(0  = 

—  e2' 
e2‘ 

K.  W  J 

II 

e  4' 
c"4' 

L  W  J 

8.  x  = 


5 

1 

1 


2 

4 

1 


2 

1  |x; 
6 


x'(/)  = 


re-3n 

e-3' 

0 


,  x2(0  = 


0 

-5/ 

-St 
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10.  Determine  the  solutions  (see  proof  of  Theorem  5)  for  the  system 

of  differential  equations  in  (a)  Problem  5;  (b)  Problem  6;  (c)  Problem  7. 


11.  Let  V  be  the  vector  space  of  all  continuous  functions  on  (—00,00)  to  R"  (the 
values  of  x(/)  lie  in  R").  Let  x1,x2,...,x/I  be  functions  in  V. 

(a)  Show  that  x1(/0),---,x',(/0)  linearly  independent  vectors  in  Rrt  for  some  t0 
implies  x1,x2,...,x'1  are  linearly  independent  functions  in  V. 

(b)  Is  it  true  that  xl(t0),...,xn(t0)  linearly  dependent  in  R"  for  some  t0  implies 
x!,x2, ...,xn  are  linearly  dependent  functions  in  V?  Justify  your  answer. 


12.  Let  u  be  a  vector  in  R"  (u^O). 

(a)  Is  x(/)  =  tu  a  solution  of  a  linear  homogeneous  differential  equation  x= Ax? 

(b)  Is  x(/)  =  eXtul  (c)  Is  x(f)  =  (e '  —  e  ~  ')u? 

(d)  Is  x(0=(e'  +  e-/)u?  (e)  Is  x(/)=(eAl/4-eX2/)u? 

(f)  For  what  functions  </>(/)  can  x(t)  =  <f>(t)u  be  a  solution  of  some  x=Ax? 


3.5  The  theory  of  determinants 

In  this  section  we  will  study  simultaneous  equations  of  the  form 

axxxx  +  aX2x  2  +  •••  +  alnxn  =  bx 

a2lxl  +  a22x2  +  . . .  +  a2nxn  =  b2 

0) 

•  • 

•  • 

«„!*!  +  an2X2  +  •  •  •  +  OnnX„  =  b„. 

Our  goal  is  to  determine  a  necessary  and  sufficient  condition  for  the  sys¬ 
tem  of  equations  (1)  to  have  a  unique  solution  xvx2,...,xn. 

To  gain  some  insight  into  this  problem,  we  begin  with  the  simplest  case 
n  =  2.  If  we  multiply  the  first  equation  anxx  +  aX2x2  =  bx  by  a2V  the  second 
equation  a2xxx  +  a22x2  —  b2  by  axx,  and  then  subtract  the  former  from  the 
latter,  we  obtain  that 

(^11^22  “  ^12^2l)-*2  “  a\\^2  ~~  ^21^1* 

Similarly,  if  we  multiply  the  first  equation  by  a22,  the  second  equation  by 
aX2 ,  and  then  subtract  the  latter  from  the  former,  we  obtain  that 

(^11^22  a\2a2\)X\  =  ^22^1  ”  a\2^2* 

Consequently,  the  system  of  equations  (1)  has  a  unique  solution 

a22^l~a\2^2  aU^2~~  a2\b\ 

a\\a22~~  a\2a2\  2  a\\a22~~  aUa2\ 
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if  the  number  ana22  —  ^12^21  *s  unequal  to  zero.  If  this  number  equals  zero, 
then  we  may,  or  may  not,  have  a  solution.  For  example,  the  system  of 
equations 


xl-x2 


=  1, 


2x{  —  2x2  =  4 


obviously  has  no  solutions,  while  the  system  of  equations 

Xj  — x2  =  0,  2*! -2*2  =  0 

has  an  infinity  of  solutions  xx  =  c,x2  =  c  for  any  number  c.  For  both  these 
systems  of  equations, 

a\\a22~  ana2\=:  1(“ 2)  —  (—  1)2  =  0. 

The  case  of  three  equations 

4"  ClX2X2  ^13^3  “  ^1 

a2\X\  a22X2  a23X3  =  ^ 2  (2) 

#31*1  "4"  0,y2x2  4"  Cl-33x3  ^3 

in  three  unknowns  xvx2,x3  can  also  be  handled  quite  easily.  By  eliminat¬ 
ing  one  of  the  variables  from  two  of  the  equations  (2),  and  thus  reducing 
ourselves  to  the  case  «  =  2,  it  is  possible  to  show  (see  Exercise  1)  that  the 
system  of  equations  (2)  has  a  unique  solution  xx,x2,x3  if,  and  only  if,  the 
number 


1^22^33  +  ^12^23^31  +  al  3^21^32  a\3a22a3\  a\2a2\a33  a\\a23a32  0) 

is  unequal  to  zero. 

We  now  suspect  that  the  system  of  equations  (1),  which  we  abbreviate 
in  the  form 


f - 

• 

• 

• 

V _ 

'X,  ' 

- ‘N 

J 

Ax  =  b,  A  = 

•  • 

•  • 

•  • 

anX  ...  Qnn 

,  x  = 

A 

,  b= 

e - 

V _ 

has  a  unique  solution  x  if,  and  only  if,  a  certain  number,  which  depends  on 
the  elements  atj  of  the  matrix  A,  is  unequal  to  zero.  We  can  determine  this 
number  for  n  —  4  by  eliminating  one  of  the  variables  from  two  of  the  equa¬ 
tions  (1).  However,  the  algebra  is  so  complex  that  the  resulting  number  is 
unintelligible.  Instead,  we  will  generalize  the  number  (3)  so  as  to  associate 
with  each  system  of  equations  Ax  =  b  a  single  number  called  determinant  A 
(detA  for  short),  which  depends  on  the  elements  of  the  matrix  A.  We  will 
establish  several  useful  properties  of  this  association,  and  then  use  these 
properties  to  show  that  the  system  of  equations  (4)  has  a  unique  solution  x 
if,  and  only  if,  detA^O. 

If  we  carefully  analyze  the  number  (3),  we  see  that  it  can  be  described  in 
the  following  interesting  manner.  First,  we  pick  an  element  aXj  from  the 
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first  row  of  the  matrix 

a\\  a\2  a\3 

A=  a2l  a22  a23  • 

a3l  a  32  a33 

This  element  can  be  either  axx,ax2 ,  or  al3.  Then,  we  multiply  aXJ j  by  an  ele¬ 
ment  alj2  from  the  second  row  of  A.  However,  j2  must  not  equal  jx.  For  ex¬ 
ample,  if  we  choose  aX2  from  the  first  row  of  A,  then  we  must  choose  either 
a2l  or  a23  from  the  second  row  of  A.  Next,  we  multiply  these  two  numbers 
by  the  element  in  the  third  row  of  A  in  the  remaining  column.  We  do  this 
for  all  possible  choices  of  picking  one  element  from  each  row  of  A,  never 
picking  from  the  same  column  twice.  In  this  manner,  we  obtain  6  different 
products  of  three  elements  of  A,  since  there  are  three  ways  of  choosing  an 
element  from  the  first  row  of  A,  then  two  ways  of  picking  an  element  from 
the  second  row  of  A,  and  then  only  one  way  of  choosing  an  element  from 
the  third  row  of  A.  Each  of  these  products  ^ij^2j2a3j3  multiplied  by  ±  1, 
depending  on  the  specific  order  jxj2j3-  The  products  ax j^2j2a3j3  with 
0*172^*3) =  (123),  (231),  and  (312)  are  multiplied  by  +  1,  while  the  products 
a\j02j2a3j3  with  (717*273)“  (321),  (213),  and  (132)  are  multiplied  by  —  1.  Fi¬ 
nally,  the  resulting  numbers  are  added  together. 

The  six  sets  of  numbers  (123),  (231),  (312),  (321),  (213),  and  (132)  are 
called  permutations ,  or  scramblings ,  of  the  integers  1,  2,  and  3.  Observe  that 
each  of  the  three  permutations  corresponding  to  the  plus  terms  requires  an 
even  number  of  interchanges  of  adjacent  integers  to  unscramble  the  per¬ 
mutation,  that  is,  to  bring  the  integers  back  to  their  natural  order.  Simi¬ 
larly,  each  of  the  three  permutations  corresponding  to  the  minus  terms  re¬ 
quires  an  odd  number  of  interchanges  of  adjacent  integers  to  unscramble 
the  permutation.  To  verify  this,  observe  that 

23 1  — >2 1 3  — >  1 23  (2  interchanges) 

3 12 — >132 — >123  (2  interchanges) 

32 1  — >3 12 — >132 — >123  (3  interchanges) 

213 — >123  and  132 — >123  (1  interchange  each). 

This  motivates  the  following  definition  of  the  determinant  of  an  n  X  n 
matrix  A. 


Definition. 


det  A  = 


Ej\h-~jnaV\aV2'  '  Mnjn 


where  e )jj  =  1  if  the  permutation  (Jdi- •  Jn)  *s  even,  that  is,  if  we  can 
bring  the  integers  jx . .  ,jn  back  to  their  natural  order  by  an  even  number 
of  interchanges  of  adjacent  integers,  and  e,-  .  =  —  1  if  the  permutation 

•  Jn)  *s  °dd.  In  other  words,  pick  an  element  aXJi  from  the  first  row 
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of  the  matrix  A.  Then,  multiply  it  by  an  element  a2ji  from  the  second 
row  of  A,  with  j2=£jx.  Continue  this  process,  going  from  row  to  row,  and 
always  picking  from  a  new  column.  Finally,  multiply  the  product 
a\j\a2j1"Mnjn  by  +  1  if  the  permutation  (Jx j2.-Jn)  is  even  and  by  —  1  if 
the  permutation  is  odd.  Do  this  for  all  possible  choices  of  picking  one 
element  from  each  row  of  A,  never  picking  from  the  same  column  twice. 
Then,  add  up  all  these  contributions  and  denote  the  resulting  number 
by  detA. 


Remark.  There  are  many  different  ways  of  bringing  a  permutation  of  the 
integers  1,2, back  to  their  natural  order  by  successive  interchanges  of 
adjacent  integers.  For  example, 

43 1 2-+4 1 32->  1 432 ->  1 423-+ 1 243-+ 1 234 
and 

43 1 2-+34 1 2-+3 1 42-+3 1 24-+ 1 324-* 1 234. 

However,  it  can  be  shown  that  the  number  of  interchanges  of  adjacent  in¬ 
tegers  necessary  to  unscramble  the  permuation  jx . .  Jn  is  always  odd  or  al¬ 
ways  even.  Hence  j  is  perfectly  well  defined. 

Example  1.  Let 

A  =  (*n  anX2\ 

\  “21  “22  ) 

In  this  case,  there  are  only  two  products  axxa2 2  and  ax2a2 x  that  enter  into 
the  definition  of  detA.  Since  the  permutation  (12)  is  even,  and  the  per¬ 
mutation  (21)  is  odd,  the  term  axxa22  is  multiplied  by  +1  and  the  term 
aX2a2X  is  multiplied  by  -  1.  Hence,  detA=  axxa22- aX2a2X. 


Example  2.  Compute 


1  *1 
2  1 

-1  2. 


Solution.  A  shorthand  method  for  computing  the  determinant  of  a  3x3 
matrix  is  to  write  the  first  two  columns  after  the  matrix  and  then  take 
products  along  the  diagonals  as  shown  below. 


i 

1 

r 

K 

sk 

\'  \' 

y 

det 

3 

2 

i 

=  3 

X2x 

\,/X3  7 
x'1  \  \ 

2 

-1 

-1 

2. 

-lx 

X2  -1  - 
"■+  N- 

=  1-2-2+  1- 1(-  1)+  l-3(-  1) - (— 1)-2- 1  —  (— 1)- 1  - 1-2-3- 1  =  -3. 

If  A  is  an  n  X  n  matrix,  then  detA  will  contain,  in  general,  n\  products  of 
n  elements.  The  determinant  of  a  4x4  matrix  contains,  in  general,  24 
terms,  while  the  determinant  of  a  10  X  10  matrix  contains  the  unacceptably 
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large  figure  of  3,628,800  terms.  Thus,  it  is  practically  impossible  to  com¬ 
pute  the  determinant  of  a  large  matrix  A  using  the  definition  (5)  alone.  The 
smart  way,  and  the  only  way,  to  compute  determinants  is  to  (i)  find  special 
matrices  whose  determinants  are  easy  to  compute,  and  (ii)  reduce  the  prob¬ 
lem  of  finding  any  determinant  to  the  much  simpler  problem  of  computing 
the  determinant  of  one  of  these  special  matrices.  To  this  end,  observe  that 
there  are  three  special  classes  of  matrices  whose  determinants  are  trivial  to 
compute. 


1.  Diagonal  matrices :  A  matrix 

au  0  0  ...  0 

0  a12  0  ...  0 

A  = 

•  •  •  •  y 

•  •  •  • 

•  •  •  • 

0  0  0  ...  ann 

V  J 

whose  nondiagonal  elements  are  all  zero,  is  called  a  diagonal  matrix.  Its 
determinant  is  the  product  of  the  diagonal  elements  an,a22, This 
follows  immediately  from  the  observation  that  the  only  way  we  can  choose 
a  nonzero  element  from  the  first  row  of  A  is  to  pick  an.  Similarly,  the  only 
way  we  can  choose  a  nonzero  element  from  the  jth  row  of  A  is  to  pick 
Thus  the  only  nonzero  product  entering  into  the  definition  of  detA  is 
ona22...ann,  and  this  term  is  multiplied  by  +1  since  the  permutation 
(12... «)  is  even. 


2.  Lower  diagonal  matrices :  A  matrix 

an  0  ...  0 

a2\  a22  •  •  •  0 

A  = 

•  •  • 

•  •  • 

•  •  • 

anl  an2  •**  ann 

V.  > 

whose  elements  above  the  main  diagonal  are  all  zero,  is  called  a  lower  di¬ 
agonal  matrix,  and  its  determinant  too  is  the  product  of  the  diagonal  ele¬ 
ments  au, . . .  ,ann.  To  prove  this,  observe  that  the  only  way  we  can  choose  a 
nonzero  element  from  the  first  row  of  A  is  to  pick  an.  The  second  row  of  A 
has  two  nonzero  elements,  but  since  we  have  already  chosen  from  the  first 
column,  we  are  forced  to  pick  a22  from  the  second  row.  Similarly,  we  are 
forced  to  pick  from  the yth  row  of  A.  Thus,  detA  =  a\\a22...ann. 


3.  Upper  diagonal  matrices :  A  matrix 


j 
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whose  elements  below  the  main  diagonal  are  all  zero  is  called  an  upper  di¬ 
agonal  matrix  and  its  determinant  too  is  the  product  of  the  diagonal  ele¬ 
ments  au,...,ann.  To  prove  this,  we  proceed  backwards.  The  only  way  we 
can  choose  a  nonzero  element  from  the  last  row  of  A  is  to  pick  ann .  This 
then  forces  us  to  pick  an_X  n_x  from  the  (n  —  l)st  row  of  A.  Similarly,  we 
are  forced  to  pick  from  the yth  row  of  A.  Hence  det  A  =  n22aii- 

We  now  derive  some  simple  but  extremely  useful  properties  of  determi¬ 
nants. 


Property  1.  If  we  interchange  two  adjacent  rows  of  A,  then  we  change  the 
sign  of  its  determinant. 


Proof.  Let  B  be  the  matrix  obtained  from  A  by  interchanging  the  kih  and 
(k  4-  l)st  rows.  Observe  that  all  of  the  products  entering  into  the  definition 
of  detB  are  exactly  the  same  as  the  products  entering  into  the  definition  of 
detA.  The  only  difference  is  that  the  order  in  which  we  choose  from  the 
columns  of  A  and  B  is  changed.  For  example,  let 


and 


3  4 

2  -1 
-1  0. 


The  product  4x2x2  appears  in  detA  by  choosing  first  from  the  first  row, 
third  column;  then  from  the  second  row,  first  column;  and  finally  from  the 
third  row,  second  column.  This  same  product  appears  in  detB  by  choosing 
first  from  the  first  row,  third  column;  then  from  the  second  row,  second 
column;  and  finally  from  the  third  row,  first  column.  More  generally,  the 
term 


a 


i/i  • 


Mkjkak+\Jk+l 


in  detA  corresponds  to  the  term 

b\ i\  *  •  *  ^ 7kjk+fik + i  jk  •  •  •  bnjn 

in  detB.  The  sign  of  the  first  term  is  determined  by  the  permutation 
U\--jkJk+\"Jn)  while  the  sign  of  the  second  term  is  determined  by  the 
permutation  (Ji--Jk+\jk--Jn)-  Since  the  second  permutation  is  obtained 
from  the  first  by  interchanging  the  kth  and  (k+  l)st  elements,  we  see  that 
these  two  terms  have  opposite  signs.  Hence  detB=  —  detA.  □ 


Property  2.  If  we  interchange  any  two  rows  of  A,  then  we  change  the  sign 
of  its  determinant. 
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Proof.  We  will  show  that  the  number  of  interchanges  of  adjacent  rows  re¬ 
quired  to  interchange  the  ith  and  yth  rows  of  A  is  odd.  Property  2  will  then 
follow  immediately  from  Property  1.  To  this  end,  assume  that  j  is  greater 
than  /.  We  need  j—i  successive  interchanges  of  adjacent  rows  to  get  the  jth 
row  into  the  ith  place,  and  then  j  —  i  —  1  successive  interchanges  of  adjacent 
rows  to  get  the  original  ith  row  into  the yth  place.  Thus  the  total  number  of 
interchanges  required  is  2 {j  —  i)  —  1,  and  this  number  is  always  odd.  □ 

Property  3.  If  any  two  rows  of  A  are  equal,  then  detA  =  0. 

Proof.  Let  the  ith  and yth  rows  of  A  be  equal,  and  let  B  be  the  matrix  ob¬ 
tained  from  A  by  interchanging  its  ith  and yth  rows.  Obviously,  B  =  A.  But 
Property  2  states  that  detB=  —  detA.  Hence,  detA=  —  detA  if  two  rows  of 
A  are  equal,  and  this  is  possible  only  if  detA  =  0.  □ 

Property  4.  det cA  =  cn det A. 

Proof.  Obvious.  □ 

Property  5.  Let  B  be  the  matrix  obtained  from  A  by  multiplying  its  ith  row 
by  a  constant  c.  Then,  detB  =  cdetA. 

Proof.  Obvious.  □ 

Property  6.  Let  AT  be  the  matrix  obtained  from  A  by  switching  rows  and 
columns.  The  matrix  Ar  is  called  the  transpose  of  A.  For  example,  if 


A  = 

1 

6 

3  2' 
9  4 

,  then  At= 

\l  6 

3  9 

-n 

2 

-1 

2  7. 

.2  4 

7. 

A  concise  way  of  saying  this  is  (A T)iJ  =  aji.  Then, 

det  AT= detA. 

Proof.  It  is  clear  that  all  the  products  entering  into  the  definition  of  detA 
and  detAr  are  equal,  since  we  always  choose  an  element  from  each  row 
and  each  column.  The  proof  that  these  products  have  the  same  sign, 
though,  is  rather  difficult,  and  will  not  be  included  here.  (Frankly, 
whenever  this  author  teaches  determinants  to  his  students  he  wishes  that 
he  were  king,  so  that  he  could  declare  Property  6  true  by  edict.)  □ 

Remark.  It  follows  immediately  from  Properties  2,  3,  and  6  that  we  change 
the  sign  of  the  determinant  when  we  interchange  two  columns  of  A,  and 
that  detA  =  0  if  two  columns  of  A  are  equal. 

Property  7.  If  we  add  a  multiple  of  one  row  of  A  to  another  row  of  A,  then 
we  do  not  change  the  value  of  its  determinant. 
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Proof.  We  first  make  the  crucial  observation  that  detA  is  a  linear  func¬ 
tion  of  each  row  of  A  separately.  By  this  we  mean  the  following.  Write  the 
matrix  A  in  the  form 


a,  i  a, 2 

...  a^n 

a1 

A  = 

a2l  a22  •••  a2n 

•  •  • 

= 

a2 

• 

•  •  • 

•  •  • 

• 

• 

an2 

•  •  •  M 
/m  J 

a" 

^  j 

where 

a  (^i  i>  ^i2>  •  *  •  >  ®\n) 
a  ^22’  *  *  *  ’  &2n) 


a 


n2>  *  * 


Then 


and 


For  example, 


•'N 

a1 

a1 

det 

=  cdet 

• 

• 

• 

a* 

• 

• 

• 

a  n 

i\ 

* 

a1 

• 

a1 

• 

r  > 

a1 

• 

det 

• 

• 

a*+b 

• 

=  det 

• 

• 

a k 

• 

+  det 

• 

• 

b 

• 

• 

• 

a" 

^  j 

• 

• 

a" 

L  J 

• 

• 

a71 

^  J 

det 

[i 

8 

5  7' 
3  7 

<L> 

II 

'1 

4 

5  1 

1  9 

+  det 

1  5  71 

4  2-2 

.6 

-1  oj 

.6 

-i  o. 

.6  -1  0. 

since  (4, 1,9) +  (4,2,  -2)  =  (8,3, 7).  Now  Equation  (i)  is  Property  5.  To  de- 
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rive  Equation  (ii),  we  compute 


det 


a*  +  b 


£:  :  &  1  / 
J 1  •  •  'Jn  1/  1 


=  2 


Ej\—jnaV\'mMHik' 


J  b  *  •  •  »Jn 


dni  +  £,  ;<Zi;  . 

^  J\--Jn  VI 


v* 


.a 


r  'i 

a1 

• 

r  "\ 

a1 

• 

=  det 

• 

• 

a k 

• 

4- det 

• 

• 

b 

• 

• 

• 

a" 

^  j 

• 

• 

a" 

k.  j 

Property  7  now  follows  immediately  from  Equation  (ii),  for  if  B  is  the 
matrix  obtained  from  A  by  adding  a  multiple  c  of  the  A:th  row  of  A  to  the 
yth  row  of  A,  then 


a1 

• 

i”  ^ 

a1 

• 

r  > 

a1 

• 

a1 

• 

det  B  =  det 

• 

• 

a7  4-  cak 

• 

• 

=  det 

• 

• 

a7 

• 

• 

4- det 

• 

• 

c?Lk 

• 

• 

=  det  A  4-  c  det 

a* 

• 

• 

• 

a* 

• 

• 

• 

an 

w  - 

a* 

• 

• 

• 

a" 

a* 

• 

• 

• 

a" 

k.  J 

a* 

• 

• 

• 

a" 

<  > 

But 


a 


k 


det 


a 


k 


0 


since  this  matrix  has  two  equal  rows.  Hence,  det  B  =  det  A. 
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Remark  1.  Everything  we  say  about  rows  applies  to  columns,  since  detAr 
=  det  A.  Thus,  we  do  not  change  the  value  of  the  determinant  when  we  add 
a  multiple  of  one  column  of  A  to  another  column  of  A. 

Remark  2.  The  determinant  is  a  linear  function  of  each  row  of  A  sep¬ 
arately.  It  is  not  a  linear  function  of  A  itself,  since,  in  general, 

det  cAi^c  det  A  and  det(A  4-  B)  ^  det  A  +  det  B. 

For  example,  if 


and 


then 


while  det  A  +  det  B  =  3  -  9  =  —6. 

Property  7  is  extremely  important  because  it  enables  us  to  reduce  the 
problem  of  computing  any  determinant  to  the  much  simpler  problem  of 
computing  the  determinant  of  an  upper  diagonal  matrix.  To  wit,  if 


«u 

a  12 

•  •  • 

A  = 

a2 1 

• 

• 

a22 

• 

• 

•  •  • 

a2  „ 

9 

• 

t - 

• 

«»2 

•  •  • 

• 

a 

nn 

and  an  ^=0,  then  we  can  add  suitable  multiples  of  the  first  row  of  A  to  the 
remaining  rows  of  A  so  as  to  make  the  resulting  values  of  a2V---,cinl  all 
zero.  Similarly,  we  can  add  multiples  of  the  resulting  second  row  of  A  to 
the  rows  beneath  it  so  as  to  make  the  resulting  values  of  a32, . . .  ,an2  all  zero, 
and  so  on.  We  illustrate  this  method  with  the  following  example. 


Example  3.  Compute 


det 


2  3' 

0  2 

1  -1 

3  0. 
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Solution.  Subtracting  twice  the  first  row  from  the  second  row;  four  times 
the  first  row  from  the  third  row;  and  the  first  row  from  the  last  row  gives 


det 


r 

l 

-1 

2 

3 

r 

l 

-1 

2 

3 

2 

2 

0 

2 

=  det 

0 

4 

-4 

-4 

4 

1 

-1 

-1 

0 

5 

-9 

-13 

A 

2 

3 

0, 

.0 

3 

1 

-3. 

=  4  det 


1 

0 

0 


Next,  we  subtract  five  times  the  second  row  of  this  latter  matrix  from  the 
third  row,  and  three  times  the  second  row  from  the  fourth  row.  Then 


f 

1 

-1 

2 

> 

3 

r 

l 

-1 

2 

3 

2 

2 

0 

2 

=  4  det 

0 

1 

-1 

-1 

4 

1 

-1 

-1 

0 

0 

-4 

-8 

.1 

2 

3 

0, 

.0 

0 

4 

0. 

Finally,  adding  the  third  row  of  this  matrix  to  the  fourth  row  gives 


[1 

-1 

2 

3' 

ri 

-1 

2 

31 

2 

2 

0 

2 

=  4  det 

0 

1 

-1 

-1 

4 

1 

-1 

-1 

0 

0 

-4 

-8 

.1 

2 

3 

0. 

.0 

0 

0 

-8J 

=  4(  — 4)(  — 8)=  128. 

(Alternately,  we  could  have  interchanged  the  third  and  fourth  columns  of 
the  matrix 

1  -1  2  3' 

0  1-1-1 

0  0-4-8 

.0  0  4  0. 

to  yield  the  same  result.) 

Remark  1.  If  an  =  0  and  ajX^Q  for  some y,  then  we  can  interchange  the 
first  and  yth  rows  of  A  so  as  to  make  an  ¥=0.  (We  must  remember,  of 
course,  to  multiply  the  determinant  by  - 1.)  If  the  entire  first  column  of  A 
is  zero,  that  is,  if  au  =  a2X  =  ...  =anX  =  0  then  we  need  proceed  no  further 
for  detA  =  0. 
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Remark  2.  In  exactly  the  same  manner  as  we  reduced  the  matrix  A  to  an 
upper  diagonal  matrix,  we  can  reduce  the  system  of  equations 

011*1  +  012*2+  •••  +  <*\nxn  =  bx 
021*1  022*2  •  •  •  +  a2nXn  =  ^2 


0«1*1  0/i2*2  *  •  *  "F  0«/i*« 

to  an  equivalent  system  of  the  form 


=  b 


n 


CnXi  +  Cl2X2+  •••  +c\nxn 
c22x2+  ...+c2nx„ 


4 

d2 


C  X 
*■71/1  "*71 


d  . 


We  can  then  solve  (if  cnn  =^0)  for  from  the  last  equation,  for  from 

the  ( n  —  l)st  equation,  and  so  on. 

Example  4.  Find  all  solutions  of  the  system  of  equations 

*i  +  *2  +  *3=  1 

-x1  +  x2  +  *3  =  2 

2xx  —  x2  +  x3  =  3. 

Solution .  Adding  the  first  equation  to  the  second  equation  and  subtracting 
twice  the  first  equation  from  the  third  equation  gives 

*i  +  x2  +  *3  =  1 

2x2  +  2x3  =  3 
—  3x2  —  x3=  1. 

Next,  adding  \  the  second  equation  to  the  third  equation  gives 

xx  +  x2  +  x3=\ 

2x2  +  2x3  =  3 

2*3  =  y. 

Consequently,  x3  =  y,  x2  =  (3-y)/2=  -f,  and  x,  =  l  +  f-y=-y 


Exercises 

1.  Show  that  the  system  of  equations 

<*11*1  +  <*12*2  +  <*13*3  =  &1 
<*21*1  +  <*22*2  +  <*23*3  =  ^2 
<*31*1  +  <*32*2  +  <*33*3  =  *3 
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has  a  unique  solution  xx,x2,x3  if,  and  only  if, 


det 


a\\ 

aX2 

«13\ 

a2\ 

a22 

a23  It^O. 

al\ 

a32 

«33  / 

Hint :  Solve  for  xx  in  terms  of  x2  and  x3  from  one  of  these  equations. 

2.  (a)  Show  that  the  total  number  of  permutations  of  the  integers  1,2,...,/!  is 
even. 

(b)  Prove  that  exactly  half  of  these  permutations  are  even,  and  half  are  odd. 


In  each  of  Problems  3-8  compute  the  determinant  of  the  given  matrix. 


/I  0  2 

3.  1  2  5 

\6  8  0 


t  r 

t2  1 
/  1  - 


0  a  b  O' 

—  a  0  c  0 

-b  -c  0  0 

l  0  0  0  lj 


7. 


8. 


u 

fl 

1 

o 

o 

0 


-1  6  3 

0  1  -1 
3  0  2 

-1  1  Oj 

1  1  1  1 
0  0  0  2 
10  0  3 

0  10  4 

0  0  15 


9.  Without  doing  any  computations,  show  that 


det 

a  b 
d  e 

> 

c 

f 

=  det 

e  b 
d  a 

h 

g 

g  h 

K. 

k 

J 

J  c 

k 

J 

In  each  of  Problems  10-15,  find  all 
tions 

10.  xx  +  x2— x3=0 
2xx  +  x3=  14 

x2  +  x3=  13 

12.  xx  +  x2+x3=0 
xx  —  x2  —  x3  =  0 
x2  +  x3  =  0 

14.  xx+x2  +  2x3  —  x4=1 
xx—x2  +  2x3+x4  =  2 
x\  +x2  +  2x3  —  x4=  1 
—  xx  —x2  —  2x3+x4  =  0 


solutions  of  the  given  system  of  equa- 

11.  x\  +x2  +  x3  =  6 
x1-x2-x3=-4 

x2  +  x3~  “  1 

13.  *!+  x2+  x3  —  x4=l 
xx  -\-2x2  —  2x3+x4=  1 
xx  +  3x2  —  3x3  —  x4=  1 
xi +  ^x2  ~  4x3  —  x4  =  1 

15.  jcj—  x2+x3+  x4  =  0 
xx  +2x2—x3  +  3x4  =  0 
3xx  +  3x2 — x3  +  lx4  =  0 
—  xx  +  2x2+x3—  x4=0 
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3.6  Solutions  of  simultaneous  linear  equations 

In  this  section  we  will  prove  that  the  system  of  equations 


'an 

...  a^n 

■*r 

V 

Ax  =  b,  A  = 

• 

• 

• 

an\ 

m 

• 

• 

. . .  ann 

,  x  = 

• 

• 

• 

Xn 

<  J 

V* 

II 

1:1 

has  a  unique  solution  x  if  detA^O.  To  this  end,  we  define  the  product  of 
two  n  X  n  matrices  and  then  derive  some  additional  properties  of  determi¬ 
nants. 


Definition.  Let  A  and  B  be  n  X  n  matrices  with  elements  a j  and  btJ  respec¬ 
tively.  We  define  their  product  AB  as  the  nXn  matrix  C  whose  ij  ele¬ 
ment  Cy  is  the  product  of  the  /th  row  of  A  with  the  yth  column  of  B. 
That  is  to  say, 


2  aikbkf 


k  =  1 


Alternately,  if  we  write  B  in  the  form  B  =  (b\b2,  ...,b"), where  b-'is  the 
jth.  column  of  B,  then  we  can  express  the  product  C=AB  in  the  form 
C  =  (Ab1, Ab2, ...,Abw),  since  the  /th  component  of  the  vector  Aby  is 

2  k—  \  aikPkj  ' 


Example  1.  Let 


and 


1 

2  1 

1  1 

j 


Compute  AB. 
Solution. 


1  O' 
1  1 
0  Oj 


'3 

1 

-1 

'  1 

-1 

O' 

0 

2 

1 

2 

-1 

1 

.1 

1 

1 

-1 

0 

0 

J 

'3+2+1 

— 

-3-1+0 

0+1+0' 

0+4-1 

0-2  +  0 

0+2+0 

.1+2-1 

— 

-1-1+0 

0+1+0 

'6  -4 

11 

3  -2 

2 

2  -2 

1 
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Example  2.  Let  A  and  B  be  the  matrices  in  Example  1.  Compute  BA. 
Solution. 

'  1  -1  0]  f  3  1  -11  f  3+0+0  1-2+0  -1-1+0' 

2-1102  1  =  6+0+1  2-2+1  -2-1+1 
-1  0011  1  -3+0+0  -1+0+0  1+0+0 

v  J  V-  J  V.  J 

'  3  -1  -2' 

=  7  1-2 

[-3  -1  lj 

Remark  1.  As  Examples  1  and  2  indicate,  it  is  generally  not  true  that  AB= 
BA.  It  can  be  shown,  though,  that 

A(BC)  =  (AB)C  (2) 

for  any  three  nXn  matrices  A,  B,  and  C.  We  will  give  an  extremely  simple 
proof  of  (2)  in  the  next  section. 

Remark  2.  Let  I  denote  the  diagonal  matrix 

1  0  ...O' 

01  ...  0 

•  •  *  • 

•  •  • 

•  •  • 

00  ...  1 

J 

I  is  called  the  identity  matrix  since  IA  =  AI  =  A  (see  Exercise  5)  for  any 
n  X  n  matrix  A. 

The  following  two  properties  of  determinants  are  extremely  useful  in 
many  applications. 

Property  8. 

det  AB  =  det  A  X  det  B. 

That  is  to  say,  the  determinant  of  the  product  is  the  product  of  the  de¬ 
terminants. 

Property  9.  Let  A(/|y)  denote  the  (n  —  l)X(n  —  1)  matrix  obtained  from  A 
by  deleting  the  /th  row  and  j th  column  of  A.  For  example,  if 

12  6 
A=  0  -1  -3 

,-4  -5  0 

Let  ciJ  =  (—  l),+ydetA(/|y).  Then 

n 

detA=  2  cijjCjj 

i  =  i 
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for  any  choice  of  j  between  1  and  n.  This  process  of  computing  determi¬ 
nants  is  known  as  “expansion  by  the  elements  of  columns,”  and  Prop¬ 
erty  9  states  that  it  does  not  matter  which  column  we  choose  to  expand 
about.  For  example,  let 

1  3  2  6' 

A=  9-107 

2  1  3  4' 

.-1  6  3  5. 

Expanding  about  the  first,  second,  third,  and  fourth  columns  of  A,  re¬ 
spectively,  gives 

-1  0  71  [3  2  6' 

detA=det  1  3  4  -9det  1  3  4 

.  6  3  5J  [6  3  5, 

'  3  2  61  [  3  2  6' 

+  2det  -1  0  7  +det  -1  0  7 

.  6  3  5J  1  3  4. 

'  9  0  71  [12  6' 

=  - 3 det  2  3  4  -det  2  3  4 

,-l  3  5  J  [-1  3  5. 

'  1  2  61  [12  6' 

-det  9  0  7  +6 det  9  0  7 
.-1  3  5j  [2  3  4. 

'  9  -1  71  [1  3  6' 

=  2  det  2  1  4  +3  det  9  -1  7 

.-1  6  5j  [-1  6  5. 

1  3  6 

-3det  9  -l  7 

.2  1  4, 

'  9  -1  01  [13  2' 

=  -6det  2  1  3  +7  det  2  1  3 

.-1  6  3j  [-1  6  3. 

[1  3  21  [1  32' 

-4  det  9  -10  +5  det  9  -1  0  . 

.-1  6  3j  [2  1  3. 

We  will  derive  Properties  8  and  9  with  the  aid  of  the  following  lemma. 

Lemma  1.  Let  D  =  D  (A)  be  a  function  that  assigns  to  each  n  X  n  matrix  A 
a  number  D  (A).  Suppose,  moreover,  that  D  is  a  linear  function  of  each 
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column  (row)  of  A  separately,  i.e. 

JD(a1,...,ay  +  cby,...,a")  =  Z)(a1,...,a7,...,a,l)  +  cZ)  (a1,...,lK...,a'1), 

and  D  (B)  =  —  D  (A)  if  B  is  obtained  from  A  by  interchanging  two  col¬ 
umns  (rows)  of  A.  Then 

D  (A)  =  det  A  X  D  (I). 


A  function  D  that  assigns  to  each  nXn  matrix  a  number  is  called  alter¬ 
nating  if  D  (B)  =  —  D  (A)  whenever  B  is  obtained  from  A  by  interchanging 
two  columns  (rows)  of  A.  Lemma  1  shows  that  the  properties  of  being 
alternating  and  linear  in  the  columns  (rows)  of  A  serve  almost  completely 
to  characterize  the  determinant  function  det  A.  More  precisely,  any  func¬ 
tion  D  (A)  which  is  alternating  and  linear  in  the  columns  (rows)  of  A  must 
be  a  constant  multiple  of  det  A.  If,  in  addition,  Z)(I)=  1,  then  Z)(A)  =  detA 
for  all  n  X  n  matrices  A.  It  also  follows  immediately  from  Lemma  1  that  if 
D  (A)  is  alternating  and  linear  in  the  columns  of  A,  then  it  is  also  alternat¬ 
ing  and  linear  in  the  rows  of  A. 

Proof  of  lemma  1.  We  first  write  A  in  the  form  A  =  (a\ a2,..., a")  where 


On' 

r - 

<N 

o' 

^ _ 

O,  n 

a2\ 

a22 

flW  = 

o2„ 

• 

• 

• 

an\  _ 

,  a2  = 

CN 

...  s: 

_ J 

• 

• 

Then,  writing  a1  in  the  form  a1  =  ax  ,e]  +  . . .  +  anXen  we  see  that 

D(A)  =  Z)(«11e1+  ...  4-<3/lle",a2, ...,an) 

=  a11Z)(e1,a2,...,a'I)+ ...  +  <zwlZ)(e't,a2,...,a'1) 

=  lLa\jxD  (e-/,,a2,...,aw). 
j\ 

Similarly,  writing  a2  in  the  form  a2  =  a12e’  +  . . .  4-  an2en  we  see  that 

D (A)  =  2  alja2Jp(eJ',eJ*> a3,..., a"). 

j\Jl 

Proceeding  inductively,  we  see  that 

£(A)=  2  a\jpJl ■  ■  •  anjP (e*. e72> •••. e7"). 

■/!>•••  >Jn 


Now,  we  need  only  sum  over  those  integers  jxJ2,  •  •  •  Jn  with  ji^Jk  since 
D  (A)  is  zero  if  A  has  two  equal  columns.  Moreover, 


D  (e*.  ej\  ...,ep  =  tjUi..  (e\  e2, . . . ,  e”)  =  e,. . .  ,jD  (I). 
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Consequently, 

0(A)-  2  ej,  -  j„aijl  ■  ■  ■  a»j„D  (I)  =  det  A  X  D  (I).  □ 

J 1 » •  •  ’  >Jn 

We  are  now  in  a  position  to  derive  Properties  8  and  9. 

Proof  of  property  8.  Let  A  be  a  fixed  nXn  matrix,  and  define  the  func¬ 
tion  D  (B)  by  the  formula 

D(  B)  =  detAB. 

Observe  that  Z)(B)  is  alternating  and  linear  in  the  columns  b\...,b/l  of  B. 
This  follows  immediately  from  the  fact  that  the  columns  of  AB  are 
Ab^.-jAb".  Hence,  by  Lemma  1. 

D  (B)  =  det  B  X  D  (I)  =  det  B  X  det  AI  =  det  A  X  det  B.  □ 

Proof  of  property  9.  Pick  any  integer  j  between  1  and  n  and  let 

D{ A)  =  2  auc>p 
1=1 

where  ciJ  =  (—  l),+7detA(/|y).  It  is  trivial  to  verify  that  D  is  alternating  and 
linear  in  the  columns  of  A.  Hence,  by  Lemma  1, 

D  (A)  =  det  A  X  D  (I)  =  det  A.  □ 

The  key  to  solving  the  system  of  equations  (1)  is  the  crucial  observation 
that 

2  aikcij=°  fc>r  k*j  (3) 

Z=  1 

where  c^  =  (—  l)/"l'ydetA(/|y).  The  proof  of  (3)  is  very  simple:  Let  B  denote 
the  matrix  obtained  from  A  by  replacing  the yth  column  of  A  by  its  A:th  col¬ 
umn,  leaving  everything  else  unchanged.  For  example,  if 

3  5  6 

A=  4  2  1  ,  y  =  2,  and  k  =  3, 

-10-1 

V  J 

then 

'3  6  6' 

B=  4  1  1  . 

-1  -1  -1 

V  J 

Now,  the  determinant  of  B  is  zero,  since  B  has  two  equal  columns.  On  the 
other  hand,  expanding  about  the  yth  column  of  B  gives 

det B  =  2  bucu~  2 

i = 1  1=1 
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where 


4  =  ( -  l)'+7detB(/|y)  =  ( -  l)'+7det  A(/|y)  =  cir 

Hence,  2"-i  Wy  =  0  if  A:  is  unequal  to  j. 

Now,  whenever  we  see  a  sum  from  1  to  n  involving  the  product  of  terms 
with  two  fixed  indices  j  and  k  (as  in  (3)),  we  try  and  write  it  as  the  jk  ele¬ 
ment  of  the  product  of  two  matrices.  If  we  let  C  denote  the  matrix  whose  ij 
element  is  c tj  and  set 

adj  A=Cr, 

then 

n  n 

^ik^ij  A) ji  &ik  ^  ^  jk * 

/=  1  /=1 

Hence,  from  (3), 


(adj  A  X  A )Jk  =  0  for  j  =£  k. 
Combining  this  result  with  the  identity 


n 


detA=  2  aijcij  =  (adj  A  X  A) M, 
/=  1 


we  see  that 


adj  A  X  A  = 


detA  0 
0  detA 


0 

0 


=  (detA)I. 


(4) 


0  0  ...  detA 

Similarly,  by  working  with  Ar  instead  of  A  (see  Exercise  8)  we  see  that 


A  X  adj  A  = 


detA 

0 


0 

detA 


0 

0 


=  (detA)I. 


0 


detAj 


Example  3.  Let 


A  = 


1  0 
1  1 
1  2 


-1 


Compute  adj  A  and  verify  directly  the  identities  (4)  and  (5). 
Solution. 


adj  A  =  Cr = 

3 

-2 

-2  r 
2  -2 

1 

3 

-2 

-2  r 
2  0 

.  1 

0  lj 

.  1 

-2  1. 
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so  that 

'  3  -2  llfl  0  -1]  (2  0  O' 

adj  A  X  A  =  —  2  2  0  1  1  -1  =  0  2  0  =21 

1-2112  1  002 

and 

1  0  -Ilf  3  -2  ll  (2  0  O' 

A  X  adj  A  =  11  -1  -2  20  =  020  =21. 

J  2  1 J  L  1  -2  lj  [0  0  2. 

Now,  det A=  1-2+14-2  =  2.  Hence, 

adj  A  X  A  =  A  X  adj  A  =  (det  A)I. 

We  return  now  to  the  system  of  equations 

all  &ln 

Ax  =  b,  A  =  ,  x 

dft  1  *  *  *  ^nn 

V  J 

If  A  were  a  nonzero  number  instead  of  a  matrix,  we  would  divide  both 
sides  of  (6)  by  A  to  obtain  that  x  =  b/A.  This  expression,  of  course,  does 
not  make  sense  if  A  is  a  matrix.  However,  there  is  a  way  of  deriving  the 
solution  x  =  b/A  which  does  generalize  to  the  case  where  A  is  an  nXn 
matrix.  To  wit,  if  the  number  A  were  unequal  to  zero,  then  we  can  multiply 
both  sides  of  (6)  by  the  number  A-1  to  obtain  that 

A_1Ax  =  x  =  A_1b. 

Now,  if  we  can  define  A-1  as  an  n  X  n  matrix  when  A  is  an  n  X  n  matrix, 
then  the  expression  A-1b  would  make  perfectly  good  sense.  This  leads  us 
to  ask  the  following  two  questions. 

Question  1 :  Given  an  n  X  n  matrix  A,  does  there  exist  another  n  X  n  matrix, 
which  we  will  call  A”1,  with  the  property  that 

A_1A  =  AA-1  =1? 

Question  2:  If  A-1  exists,  is  it  unique?  That  is  to  say,  can  there  exist  two 
distinct  matrices  B  and  C  with  the  property  that 

BA  =  AB  =  I  and  CA  =  AC  =  I? 

The  answers  to  these  two  questions  are  supplied  in  Theorems  7  and  8. 

Theorem  7.  An  nXn  matrix  A  has  at  most  one  inverse . 

Proof.  Suppose  that  A  has  two  distinct  inverses.  Then,  there  exist  two  dis¬ 
tinct  matrices  B  and  C  with  the  property  that 

AB  =  BA  =  I  and  AC  =  CA  =  I. 
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If  we  multiply  both  sides  of  the  equation  AC  =  I  by  B  we  obtain  that 

BI  =  B(AC)  =  (BA)C  =  IC. 

Hence,  B  =  C,  which  is  a  contradiction.  □ 


Theorem  8.  A  1  exists  ij ,  and  only  if,  detA^O,  and  in  this  case 

A  —  1  _  1 


detA 


adj  A. 


Proof.  Suppose  that  detA^O.  Then,  we  can  divide  both  sides  of  the  iden¬ 
tities  (4)  and  (5)  by  detA  to  obtain  that 


adj  A 
detA 


X  A  =  I  = 


1 

detA 


A  X  adj  A  =  A  X 


adj  A 
detA  * 


Hence,  A-1  =  adj  A/detA. 

Conversely,  suppose  that  A-1  exists.  Taking  determinants  of  both  sides 
of  the  equation  A_1A  =  I,  and  using  Property  8  gives 


(detA  1)detA  =  detI=  1. 

But  this  equation  implies  that  detA  cannot  equal  zero. 


□ 


Finally,  suppose  that  detA^O.  Then,  A  1  exists,  and  multiplying  both 
sides  of  (6)  by  this  matrix  gives 

A  ' 1  Ax  =  lx  =  x  =  A  “  lb. 

Hence,  if  a  solution  exists,  it  must  be  A-1b.  Moreover,  this  vector  is  a  solu¬ 
tion  of  (6)  since 

A(A1b)  =  AA1b  =  Ib  =  b. 

Thus,  Equation  (6)  has  a  unique  solution  x  =  A“1b  if  detA^O. 


Example  4.  Find  all  solutions  of  the  equation 


1  1  f 

[o' 

'•V 

2-2  2 

X  = 

0 

,  X  = 

*2 

.3  3  -3, 

0 

^  J 

*3 

^  J 

Solution. 


/" 

i 

1 

** 

1 

'i 

1 

11 

det 

2 

-2 

2 

<D 

II 

0 

-4 

0 

.3 

3 

-3, 

.0 

0 

-6. 

Hence,  Equation  (8)  has  a  unique  solution  x.  But 


x  = 


0 
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is  obviously  one  solution.  Therefore, 

fo] 

x=  0 

.0. 

is  the  unique  solution  of  (8). 

Remark.  It  is  often  quite  cumbersome  and  time  consuming  to  compute  the 
inverse  of  an  n  X  n  matrix  A  from  (7).  This  is  especially  true  for  n  >  4.  An 
alternate,  and  much  more  efficient  way  of  computing  A-1,  is  by  means  of 
“elementary  row  operations.” 

Definition.  An  elementary  row  operation  on  a  matrix  A  is  either 

(i)  an  interchange  of  two  rows, 

(ii)  the  multiplication  of  one  row  by  a  nonzero  number, 
or 

(iii)  the  addition  of  a  multiple  of  one  row  to  another  row. 

It  can  be  shown  that  every  matrix  A,  with  det  At^O,  can  be  transformed 
into  the  identity  I  by  a  systematic  sequence  of  these  operations.  Moreover, 
if  the  same  sequence  of  operations  is  then  performed  upon  I,  it  is  trans¬ 
formed  into  A-1.  We  illustrate  this  method  with  the  following  example. 

Example  5.  Find  the  inverse  of  the  matrix 

1  0  -r 

A=  1  1  -1  . 

U  2  lj 

Solution.  The  matrix  A  can  be  transformed  into  I  by  the  following 
sequence  of  elementary  row  operations.  The  result  of  each  step  appears  be¬ 
low  the  operation  performed. 

(a)  We  obtain  zeros  in  the  off-diagonal  positions  in  the  first  column  by 
subtracting  the  first  row  from  both  the  second  and  third  rows. 

1  0  -T 

0  1  0 

.0  2  2^ 

(b)  We  obtain  zeros  in  the  off-diagonal  positions  in  the  second  column  by 
adding  ( —  2)  times  the  second  row  to  the  third  row. 

\  0  -r 

0  1  0 

.0  0  2^ 
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(c)  We  obtain  a  one  in  the  diagonal  position  in  the  third  column  by  multi¬ 
plying  the  third  row  by  |. 

1  0  -r 

0  1  0 

10  0  lj 

(d)  Finally,  we  obtain  zeros  in  the  off-diagonal  positions  in  the  third  col¬ 
umn  by  adding  the  third  row  to  the  first  row. 

1  0  O' 

0  1  0 

.0  0  1. 

If  we  perform  the  same  sequence  of  elementary  row  operations  upon  I,  we 
obtain  the  following  sequence  of  matrices: 

1  0  0]  f  1  0  0]  f  1  0  O' 

010,-110,-1  10, 

.0  0  lj  - 1  0  lj  [  1  -2  lj 

0  0]  [  |  -i  i 

1  o  ,  !  o 

1  ij  |  -1  i 

The  last  of  these  matrices  is  A-1. 


Exercises 

In  each  of  Problems  1-4,  compute  AB  and  BA  for  the  given  matrices  A 
and  B. 

3  2  0\  /-I  0 

1.  A*  1  1  1  ,  B=  I  2  1 

\2  6  0/  \  0  6 

/ 1  0  1  \  /Oil 

3.  A=  1  l  0  ,  B=  I  1  1  0 

\0  1  1/  V 1  0  1 

1111]  f  1  1  1  1 
1111  2  2  2  2 

1111’  3  3  3  3 

1  1  1  1 J  [4444 

5.  Show  that  IA  =  AI  =  A  for  all  matrices  A. 

6.  Show  that  any  two  diagonal  matrices  A  and  B  commute ,  that  is  AB  =  BA,  if  A 
and  B  are  diagonal  matrices. 
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7.  Suppose  that  AD  =  DA  for  all  matrices  A.  Prove  that  D  is  a  multiple  of  the 
identity  matrix. 

8.  Prove  that  A  X  adj  A  =  det  A  X I. 


In  each  of  Problems  9-14,  find  the  inverse,  if  it  exists,  of  the  given  matrix. 


9. 


13. 


15.  Let 


3 

0 

1 


1  1 


1 

1  +  i 
0 
1 


10. 


0  —  sin# 

1  0 
0  cos# 


12. 


1 

l-/\ 
0 
1 


1 

0 

-1 


A=(*n 

\a21 


*12  \ 
*22/ 


Show  that 


A"1- 


1 


det  A 


/  ^22  012  \ 

\  —  a2\  a\i  ) 


if  det  A  7^0. 

16.  Show  that  (AB)_1  =  B~1A“1  if  detAxdetB^O. 

In  each  of  Problems  17-20  show  that  x  =  0  is  the  unique  solution  of  the 
given  system  of  equations. 


17.  x\—  x2—  *3  =  0 
3xx  —  x2  +  2x3  =  0 
2x\  +  2x2  +  3x3  =  0 

19.  xx+  2x2—  x3  =0 
2xy+  3x2  +  x3—  x4  =  0 
—  xx  +2x3 +  2*4=0 
3xx—  *2  +  x3  +  3*4  =  0 


18.  *!  + 2*2 +4*3  =  0 

*2+  *3  =  0 
*1+  *2+  *3  =  0 

20.  *!+  2*2—  *3 +  3*4  =  0 

2*!+  3*2  —  *4  =  0 

—  *!+  *2  +  2*3+  *4  =  0 

—  *2  +  2*3 +  3*4  =  0 


3.7  Linear  transformations 

In  the  previous  section  we  approached  the  problem  of  solving  the  equation 


r  „ 

*11 

f - 

s: 

• 

• 

• 

'*r 

r 

V. _ 

Ax  =  b,  A  = 

> . . . 

• 

• 

• 

an„ 

j 

>  x  = 

.V 

II 

by  asking  whether  the  matrix  A  1  exists.  This  approach  led  us  to  the  con¬ 
clusion  that  Equation  (1)  has  a  unique  solution  x  =  A_1b  if  detA^O.  In 
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order  to  determine  what  happens  when  detA  =  0,  we  will  approach  the 
problem  of  solving  (1)  in  an  entirely  different  manner.  To  wit,  we  will  look 
at  the  set  V  of  all  vectors  obtained  by  multiplying  every  vector  in  R"  by  A 
and  see  if  b  is  in  this  set.  Obviously,  Equation  (1)  has  at  least  one  solution 
x  if,  and  only  if,  b  is  in  V.  We  begin  with  the  following  simple  but  ex¬ 
tremely  useful  lemma. 

Lemma  1.  Let  A  be  an  nXn  matrix  with  elements  a ij9  and  let  x  be  a  vector 
with  components  xux2, Let 

a\j 

a7=  : 

• 

a 

nj 

denote  the  yth  column  of  A.  Then 

Ax  =  xxd}  4-  x2a2  +  . . .  4-  xnan. 

Proof.  We  will  show  that  the  vectors  Ax  and  xxa]  +  ...  +xn*n  have  the 
same  components.  To  this  end,  observe  that  (Ax);,  the  yth  component  of 
Ax,  is  ajxxx  4- ...  4-  ajnxn ,  while  the yth  component  of  the  vector  jCja1  +  . . .  + 
xn^T  is 

xxaj  4- ...  4-  xn&j  =  xxajx  4-  . . .  4-  xnaJn  —  (Ax)y. 

Hence,  Ax  =  xxal  4-  x2a2  4- ...  4-  xnan.  □ 

Now,  let  V  be  the  set  of  vectors  obtained  by  multiplying  every  vector  x 
in  R"  by  the  matrix  A.  It  follows  immediately  from  Lemma  1  that  V  is  the 
set  of  all  linear  combinations  of  the  vectors  a1, a2,..., a",  that  is,  V  is 
spanned  by  the  columns  of  A.  Hence  the  equation  Ax  =  b  has  a  solution  if, 
and  only  if,  b  is  a  linear  combination  of  the  columns  of  A.  With  the  aid  of 
this  observation,  we  can  now  prove  the  following  theorem. 

Theorem  9.  (a)  The  equation  Ax  =  b  has  a  unique  solution  if  the  columns  of 
A  are  linearly  independent. 

(b)  The  equation  Ax  =  b  has  either  no  solution ,  or  infinitely  many  solu¬ 
tions ,  if  the  columns  of  A  are  linearly  dependent. 

Proof,  (a)  Suppose  that  the  columns  of  A  are  linearly  independent.  Then, 
a1, a2,..., a"  form  a  basis  for  R".  In  particular,  every  vector  b  can  be  written 
as  a  linear  combination  of  a1, a2,..., a".  Consequently,  Equation  (1)  has  at 
least  one  solution.  To  prove  that  (1)  has  exactly  one  solution,  we  show  that 
any  two  solutions 
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and 

'^r 

y=  : 

must  be  equal.  To  this  end,  observe  that  if  x  and  y  are  two  solutions  of  (1), 
then 

A(x  —  y)  =  Ax  —  Ay  =  b  —  b=0. 

By  Lemma  1,  therefore, 

(x1->'1)a,  +  (x2-,y2)a2  +  ... +(x„-y„)ar =0.  (2) 

But  this  implies  that  *,=.)>,,  x2=y2,...,  and  xn=y„,  since  a1, a2 . a"  are 

linearly  independent.  Consequently,  x  =  y. 

(b)  If  the  columns  of  A  are  linearly  dependent,  then  we  can  extract  from 
a1,  a2, . . . ,  a"  a  smaller  set  of  independent  vectors  which  also  span  V  (see  Ex¬ 
ercise  12,  Section  3.3).  Consequently,  the  space  V  has  dimension  at  most 
n—  1.  In  other  words,  the  space  V  is  distinctly  smaller  than  R".  Hence, 
there  are  vectors  in  R"  which  are  not  in  V.  If  b  is  one  of  these  vectors,  then 
the  equation  Ax  =  b  obviously  has  no  solutions.  On  the  other  hand,  if  b  is 
in  V,  that  is,  there  exists  at  least  one  vector  x*  such  that  Ax*=b,  then 
Equation  (1)  has  infinitely  many  solutions.  To  prove  this,  observe  first  that 
x  =  x*  is  certainly  one  solution  of  (1).  Second,  observe  that  if  A£=0  for 
some  vector  £  in  R",  then  x  =  x*+£  is  also  a  solution  of  (1),  since 

A(x*+£)  =  Ax*  +  A£=b+0  =  b. 

Finally,  observe  that  there  exist  numbers  c1,c2,...,c/I  not  all  zero,  such  that 
Cia1  -h c2a2 -I- ...  +cnan  =  0.  By  Lemma  1,  therefore,  A£=0,  where 

hi 


But  if  A£  equals  zero,  then  A(a£)  also  equals  zero,  for  any  constant  a. 
Thus,  there  are  infinitely  many  vectors  £  with  the  property  that  A£=0. 
Consequently,  the  equation  Ax  =  b  has  infinitely  many  solutions.  □ 

Example  1.  (a)  For  which  vectors 

lbl 

b=  b2 
b3 

v.  J 

can  we  solve  the  equation 

fl  12' 

Ax=  1  0  1  x=b? 

2  1  3. 
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(b)  Find  all  solutions  of  the  equation 


'1 

1 

2 

[0] 

1 

0 

1 

x  = 

0 

[2 

1 

3. 

.0. 

(c)  Find  all  solutions  of  the  equation 


1 

1 

2 

t 

1 

0 

1 

x  = 

0 

.2 

1 

3. 

Solution,  (a)  The  columns  of  the  matrix  A  are 


a1  = 

T 

l 

to 

II 

0 

and  a3  = 

2 

1 

2. 

.1, 

.3, 

Notice  that  a1  and  a2  are  linearly  independent  while  a3  is  the  sum  of  a1  and 
a2.  Hence,  we  can  solve  the  equation  Ax  =  b  if,  and  only  if, 


r 

rn 

'  ct  +  c2' 

b=  c, 

i 

L2J 

+  C2 

0 

.iJ 

^1 

2c,  +  c2 

for  some  constants  cx  and  c2.  Equivalently,  (see  Exercise  25)  b3  =  bx  +  b2 
(b)  Consider  the  three  equations 

xx  +  ;t2  +  2x3  =  0 
x\  +  x3  =  0 
2xx  +  x2  +  3x3  =  0. 


Notice  that  the  third  equation  is  the  sum  of  the  first  two  equations.  Hence, 
we  need  only  consider  the  first  two  equations.  The  second  equation  says 
that  xx  =  —  x3.  Substituting  this  value  of  xx  into  the  first  equation  gives 
x2  =  —xy  Hence,  all  solutions  of  the  equation  Ax  =  0  are  of  the  form 


(c)  Observe  first  that 


x 


1 


is  clearly  one  solution  of  this  equation.  Next,  let  x2  be  any  other  solution  of 
this  equation.  It  follows  immediately  that  x2  =  x*+£,  where  £  is  a  solution 
of  the  homogeneous  equation  Ax  =  0.  Moreover,  the  sum  of  any  solution  of 
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the  nonhomogeneous  equation 


with  a  solution  of  the  homogeneous  equation  is  again  a  solution  of  the 
nonhomogeneous  equation.  Hence,  any  solution  of  the  equation 


is  of  the  form 


Ax  = 


0 


[01 

-r 

x  = 

l 

+  C 

- 1 

.0. 

.  i. 

Theorem  9  is  an  extremely  useful  theorem  since  it  establishes  necessary 
and  sufficient  conditions  for  the  equation  Ax  =  b  to  have  a  unique  solution. 
However,  it  is  often  very  difficult  to  apply  Theorem  9  since  it  is  usually 
quite  difficult  to  determine  whether  n  vectors  are  linearly  dependent  or  lin¬ 
early  independent.  Fortunately,  we  can  relate  the  question  of  whether  the 
columns  of  A  are  linearly  dependent  or  linearly  independent  to  the  much 
simpler  problem  of  determining  whether  the  determinant  of  A  is  zero  or 
nonzero.  There  are  several  different  ways  of  accomplishing  this.  In  this  sec¬ 
tion,  we  will  present  a  very  elegant  method  which  utilizes  the  important 
concept  of  a  linear  transformation. 


Definition.  A  linear  transformation  &  taking  R"  into  R"  is  a  function 
which  assigns  to  each  vector  x  in  R"  a  new  vector  which  we  call  6E(x)  = 
&(xh...,xn).  Moreover,  this  association  obeys  the  following  rules. 

6E(cx)  =  c($(x)  (i) 

and 

<$(x  +  y)  =  6B(x)  +  &  (y).  (ii) 


Example  2.  The  transformation 

'xi 
x  = 

u 

is  obviously  a  linear  transformation  of  Rw  into  R"  since 

&(cx)  =  cx  =  c&(x) 

and 

&(x  +  y)  =  x  +  y  =  6E(x)  +  6E(y). 


324 


3.7  Linear  transformations 


Example  3.  The  transformation 


■*r 

Xi  +  X2  +  X3- 

x  = 

X2 

^<£(x)  = 

XX  +  x2  —  x3 

*3 

k.  J 

.  xt  . 

is  a  linear  transformation  taking  R3  into  R3  since 


and 


II 

'  cxx  +  cx2  +  cx3 
cxx  +  cx2  —  cx3 

=  c 

r  xx  +  x2  +  x3' 
XX  +  X2  —  x3 

=  c&(x) 

cxx 

xx 

K.  J 

®(x + y) 


(*i  +y  1) + O2  +y2) + (*3  +y2) 

(*i  +yj  +  (x2  +y2 )  -  (*3  +y3) 
xt+y{ 


'  xx  +  x2  +  x3' 

yi+yi+ys' 

Xx  +  X2  —  X3 

+ 

y  i  ^~y2  y$ 

xx 

K.  J 

=  &(x)  +  &(y). 


Example  4.  Let  ®(x,,x2)  be  the  point  obtained  from  x  =  (x{,x2)  by  rotating 
x  30°  in  a  counterclockwise  direction.  It  is  intuitively  obvious  that  any 
rotation  is  a  linear  transformation.  If  the  reader  is  not  convinced  of  this, 
though,  he  can  compute 

V3  xx-\x2 
xx  +  \  V3  x2 

and  then  verify  directly  that  &  is  a  linear  transformation  taking  R2  into  R2. 
Example  5.  The  transformation 


takes  R2  into  R2  but  is  not  linear,  since 

£(2x)  =  @,(2x„2x2)  =  I  4x2+4x2  j  ¥^2&(x). 


&(xux2)  = 


]_ 

2 

_[ 

2 


Now,  every  n  X  n  matrix  A  defines,  in  a  very  natural  manner,  a  linear 
transformation  6£  taking  R"— >R".  To  wit,  consider  the  transformation  of 
Rrt^R/*  defined  by 

x-»6E(x)=Ax. 

In  Section  3.1,  we  showed  that  A(cx)  =  cAx  and  A(x  +  y)=Ax  +  Ay.  Hence, 
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the  association  x— »6E(x)  =  Ax  is  linear.  Conversely,  any  linear  transforma¬ 
tion  &  taking  R'l-»Rn  must  be  of  the  form  (£(x)  =  Ax  for  some  matrix  A. 
This  is  the  content  of  the  following  theorem. 


Theorem  10.  Any  linear  transformation  x^6B(x)  taking  R"  into  R"  must  be 
of  the  form  (£(x)  =  Ax.  In  other  words ,  given  any  linear  transformation  & 
taking  R"  into  R",  we  can  find  an  nXn  matrix  A  such  that 

(E(x)  =  Ax 

for  all  x. 

Proof.  Let  eJ  denote  the  vector  whose  jth  component  is  one,  and  whose 
remaining  components  are  zero,  and  let 

aJ  =  &(ej). 

We  claim  that 

6E(x)  =  Ax,  A  =  (a1,a2,...,a”)  (3) 

for  every 


x  = 


in  R".  To  prove  this,  observe  that  any  vector  x  can  be  written  in  the  form 
x  =  xle1  +  ...  +  xnen.  Hence,  by  the  linearity  of  6E, 

®(x)  =  +  . . .  +  xnen)  —  x^e1)  +  . . .  +  xn&(en ) 


=  x,a1+  ...  +  x^a"  =  Ax. 


Remark  1.  The  simplest  way  of  evaluating  a  linear  transformation  &  is  to 
compute  a1  =  (£(eI),...,a'*==($(e'7)  and  then  observe  from  Theorem  10  and 
Lemma  1  that  6B(x)  =  Ax,  where  A  =  (a!,a2, ...,a").  Thus,  to  evaluate  the 
linear  transformation  in  Example  4  above,  we  observe  that  under  a  rota¬ 
tion  of  30°  in  a  counterclockwise  direction,  the  point  (1,0)  goes  into  the 
point  (|  V3  ,  \)  and  the  point  (0, 1)  goes  into  the  point  (  —  \  V3  ).  Hence, 

any  point  x  =  (x!,x2)  goes  into  the  point 


'iV3 

1 

2 

- 

X 1 

|V3  xx-\x2 

1 

2 

i\/3 

> 

x2 

K  > 

^x,  +  |V3  x2 

<  > 

Remark  2.  If  and  ®  are  linear  transformations  taking  R"  into  R",  then 
the  composition  transformation  &  °  ®  defined  by  the  relation 

®°®(x)*«(®(x)) 
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is  again  a  linear  transformation  taking  R^-^R".  To  prove  this,  observe  that 

8  °  ®  (cx)  =  6E(®  (cx))  =  6E(c®  (x))  =  c6E(®  (x)) 

=  c£B  °  ®  (x) 
and 

8  o  ®(x  +  y)  =  <*E(®(x  +  y))  =  #(®(x)  +  $(y)) 

=  (x))  +  6E(®  (y))  =  8  °  ®  (x)  +  8  o  ®  (y). 

Moreover,  it  is  a  simple  matter  to  show  (see  Exercise  15)  that  if  6E(x)  =  Ax 
and  ®  (x)  =  Bx,  then 

<£°®(x)  =  A Bx.  (4) 

Similarly,  if  8,  ® ,  and  6  are  3  linear  transformations  taking  R"  into  R", 
with  ®(x)  =  Ax,  ®(x)  =  Bx,  and  G(x)  =  Cx  then 

(ffi  °  ® )  o  G(x)  =  (AB)Cx  (5) 

and 

8  °  (®  °  G)(x) = A(BC)x.  (6) 

Now,  clearly,  (8  °  ® )  °  6(x)  =  8  °  (®  °  S)(x).  Hence, 

(AB)Cx  =  A(BC)x 

for  all  vectors  x  in  R".  This  implies  (see  Exercise  14)  that 

(AB)C  =  A(BC) 

for  any  three  n  X  n  matrices  A,  B,  and  C. 

In  most  applications,  it  is  usually  desirable,  and  often  absolutely  essen¬ 
tial,  that  the  inverse  of  a  linear  transformation  exists.  Heuristically,  the  in¬ 
verse  of  a  transformation  8  undoes  the  effect  of  8,.  That  is  to  say,  if  &(x) 
=  y,  then  the  inverse  transformation  applied  to  y  must  yield  x.  More  pre¬ 
cisely,  we  define  dE-1(y)  as  the  unique  element  x  in  Rn  for  which  ffi(x)  =  y. 
Of  course,  the  transformation  8~ 1  may  not  exist.  There  may  be  some  vec¬ 
tors  y  with  the  property  that  for  all  x  in  R".  Or,  there  may  be 

some  vectors  y  which  come  from  more  than  one  x,  that  is,  ^(x^^y  and 
ff(x2)=y.  In  both  these  cases,  the  transformation  &  does  not  possess  an 
inverse.  In  fact,  it  is  clear  that  8  possesses  an  inverse,  which  we  will  call 
8~x  if,  and  only  if,  the  equation  6E(x)  =  y  has  a  unique  solution  x  for  every 
y  in  R".  In  addition,  it  is  clear  that  if  8  is  a  linear  transformation  and  8~x 
exists,  then  8~x  must  also  be  linear.  To  prove  this,  observe  first  that 
&~x{cy)  =  c8~x{y)  since  8(cx)  =  cy  if  ffi(x)  =  y.  Second,  observe  that 
8~ 1  (y 1  +  y2)  =  #" 1  (y  *)  +  1  (y2)  Since  6E(x1+x2)  =  y1+y2  if  ®(x1)  =  y1  and 
#(x2)  =  y2.  Thus  8~x,  if  it  exists,  must  be  linear. 

At  this  point  the  reader  should  feel  that  there  is  an  intimate  connection 
between  the  linear  transformation  8~x  and  the  matrix  A-1.  This  is  the 
content  of  the  following  lemma. 
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Lemma  2.  Let  A  be  an  nXn  matrix  and  let  &  be  the  linear  transformation 
defined  by  the  equation  6B(x)  =  Ax.  Then,  &  has  an  inverse  if  and  only  if 
the  matrix  A  has  an  inverse.  Moreover ,  //A-1  exists ,  then  ®-1(x)  =  A-1x. 

Proof.  Suppose  that  1  exists.  Clearly, 

&  ©  ®“1(x)  =  ff"1  °#(x)  =  x  (7) 

and  1  is  linear.  Moreover,  there  exists  a  matrix  B  with  the  property  that 
ff-1(x)  =  Bx.  Therefore,  from  (4)  and  (7) 

ABx  =  BAx  =  x 

for  all  x  in  R".  But  this  immediately  implies  (see  Exercise  14)  that 


AB  =  BA  =  I. 


Hence  B  =  A  *. 

Conversely,  suppose  that  A-1  exists.  Then,  the  equation 


6E(x)  =  Ax  =  y 


has  a  unique  solution  x  =  A  ly  for  all  y  in  R".  Thus,  A  1  also  exists,  and 

ff-,(x)  =  A",x.  □ 

We  are  now  ready  to  relate  the  problem  of  determining  whether  the  col¬ 
umns  of  an  n  X  n  matrix  A  are  linearly  dependent  or  linearly  independent 
to  the  much  simpler  problem  of  determining  whether  the  determinant  of  A 
is  zero  or  nonzero. 

Lemma  3.  The  columns  of  an  nXn  matrix  A  are  linearly  independent  if,  and 
only  if,  detA^O. 

Proof.  We  prove  Lemma  3  by  the  following  complex,  but  very  clever 
argument. 

(1)  The  columns  of  A  are  linearly  independent  if,  and  only  if,  the  equation 
Ax  =  b  has  a  unique  solution  x  for  every  b  in  R”.  This  statement  is  just 
a  reformulation  of  Theorem  9. 

(2)  From  the  remarks  preceding  Lemma  2,  we  conclude  that  the  equation 
Ax  =  b  has  a  unique  solution  x  for  every  b  if,  and  only  if,  the  linear 
transformation  ffi(x)  =  Ax  has  an  inverse. 

(3)  From  Lemma  2,  the  linear  transformation  6E  has  an  inverse  if,  and 
only  if,  the  matrix  A-1  exists. 

(4)  Finally,  the  matrix  A-1  exists  if,  and  only  if,  detA^O.  This  is  the  con¬ 

tent  of  Theorem  8,  Section  3.6.  Therefore,  we  conclude  that  the  col¬ 
umns  of  A  are  linearly  independent  if,  and  only  if,  det  At^O.  □ 

We  summarize  the  results  of  this  section  by  the  following  theorem. 
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Theorem  11.  The  equation  Ax=b  has  a  unique  solution  x  =  A  ]b  if  detA^ 
0.  The  equation  Ax=b  has  either  no  solutions ,  or  infinitely  many  solutions 
if  detA  =  0. 

Proof.  Theorem  1 1  follows  immediately  from  Theorem  9  and  Lemma  3. 

□ 


Corollary.  The  equation  Ax=0  has  a  nontrivial  solution  ( that  is,  a  solution 


with  not  all  the  xt  equal  to  zero )  if,  and  only  if,  detA  =  0. 


Proof.  Observe  that 


is  always  one  solution  of  the  equation  Ax  =  0.  Hence,  it  is  the  only  solution 
if  det  A  7^0.  On  the  other  hand,  there  exist  infinitely  many  solutions  if  det  A 
=  0,  and  all  but  one  of  these  are  nontrivial.  □ 


Example  6.  For  which  values  of  X  does  the  equation 

\  X  X' 

1  0  1  x=0 

U  i  iJ 

have  a  nontrivial  solution? 

Solution. 

1  X  X' 

det  1  0  1  =X  +  X-  1  -X-X—  1. 

U  1  1. 

Hence,  the  equation 

1  X  X' 

1  0  1  x=0 

U  i  iJ 

has  a  nontrivial  solution  if,  and  only  if,  X=  1. 

Remark  1.  Everything  we’ve  said  about  the  equation  Ax  =  b  applies  equally 
well  when  the  elements  of  A  and  the  components  of  x  and  b  are  complex 
numbers.  In  this  case,  we  interpret  x  and  b  as  vectors  in  C",  and  the  matrix 
A  as  inducing  a  linear  transformation  of  Cn  into  itself. 


Remark  2.  Suppose  that  we  seek  to  determine  n  numbers  xl,x2,-..,xn.  Our 
intuitive  feeling  is  that  we  must  be  given  n  equations  which  are  satisfied  by 
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these  unknowns.  This  is  certainly  the  case  if  we  are  given  n  linear  equa¬ 
tions  of  the  form 


ajlxl  +  aJ2x2+ ...  +aJnx„  =  bp  j=\,2,...,n 
and  detA^O,  where 


A= 


a 


11 


a 


1  n 


ani  ...  ann 


On  the  other  hand,  our  intuition  would  seem  to  be  wrong  when  detA  =  0. 
This  is  not  the  case,  though.  To  wit,  if  detA  =  0,  then  the  columns  of  A  are 
linearly  dependent.  But  then  the  columns  of  Ar,  which  are  the  rows  of  A, 
are  also  linearly  dependent,  since  detAr=detA.  Consequently,  one  of  the 
rows  of  A  is  a  linear  combination  of  the  other  rows.  Now,  this  implies  that 
the  left-hand  side  of  one  of  the  equations  (9),  say  the  kth  equation,  is  a  lin¬ 
ear  combination  of  the  other  left-hand  sides.  Obviously,  the  equation  Ax  = 
b  has  no  solution  if  bk  is  not  the  exact  same  linear  combination  of 
bl,...,bk_l,bk+l,...,bn.  For  example,  the  system  of  equations 


xx-x2+  *3=1 

x,  +  x2+  *3=1 


2x, 


+  2x3  =  3 


obviously  has  no  solution.  On  the  other  hand,  if  bk  is  the  same  linear  com¬ 
bination  of  b{9...,bk_l,bk+l,...,bn,  then  the  A:th  equation  is  redundant.  In 
this  case,  therefore,  we  really  have  only  n  —  1  equations  for  the  n  unknowns 


Remark  3.  Once  we  introduce  the  concept  of  a  linear  transformation  we 
no  longer  need  to  view  an  n  X  n  matrix  as  just  a  square  array  of  numbers. 
Rather,  we  can  now  view  an  n  X  n  matrix  A  as  inducing  a  linear  transfor¬ 
mation  6E(x)  =  Ax  on  R".  The  benefit  of  this  approach  is  that  we  can  derive 
properties  of  A  by  deriving  the  equivalent  properties  of  the  linear  transfor¬ 
mation  &.  For  example,  we  showed  that  (AB)C  =  A(BC)  for  any  3  nXn 
matrices  A,  B,  and  C  by  showing  that  the  induced  linear  transformations 
® ,  and  Q  satisfy  the  relation  (®°®)oS  =  4°(9  °S).  Now,  this  re¬ 
sult  can  be  proven  directly,  but  it  requires  a  great  deal  more  work  (see  Ex¬ 
ercise  24). 


Exercises 


In  each  of  Problems  1-3  find  all  vectors  b  for  which  the  given  system  of 
equations  has  a  solution. 


/ 3  "I  1\ 

1.11  1  1  U  =  b 

\5  1  3/ 


1  2 

2  4 

3  6 
[4  8 


2 

4 

6 

8 


6 

9 


x=b 
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3. 


11 

1 

3 


1H 


In  each  of  Problems  4-9,  find  all  solutions  of  the  given  system  of  equa¬ 
tions. 


/ 1  2 
2  3 
\l  3 


-1  |x  =  0 

10/ 


5. 


1 

-3 

6 


u 

fl 

o 

2 

12 


1  1  0 

1  1  1 

1-1  1 

1  1  -lj 


2 

1 

0 

4 


3  4] 
1  6 
0  1 
6  8. 


9. 


1  1 

1  -1 

1  1 


- 


In  each  of  Problems  10-12,  determine  all  values  of  X  for  which  the  given 
system  of  equations  has  a  nontrivial  solution. 


/  X  1  3\ 

10.  1X3  x  =  0 

\-l  3  X/ 


11. 


1 

1 

0 

1 


1 

1 

1 

0 


X 

X 

0 

1 


1 

X 

1 

1 


x  =  0 


/X  -1  -1\ 

12.  2  -1  0  |x  =  0. 

\  3  X  5/ 


13.  (a)  For  which  value  of  X  does  the  system  of  equations 


have  a  solution? 

(b)  Find  all  solutions  for  this  value  of  X. 

14.  Suppose  that  Ax  =  Bx  for  all  vectors 


Prove  that  A  =  B. 


15.  Let  &  and  ®  be  two  linear  transformations  taking  R"  into  R".  Then,  there  exist 
n  X  n  matrices  A  and  B  such  that  £(x)  =  Ax  and  ®  (x)  =  Bx.  Show  that  &  °  ®  (x) 
=  ABx.  Hint :  &  °  ®  is  a  linear  transformation  taking  R"  into  R".  Hence,  there 
exists  an  nXn  matrix  C  such  that  &  °  ®  (x)  =  Cx.  The  y'th  column  of  C  is 
&  °  ®  (e7).  Thus  show  that  &  °  ®  (e  j)  is  the  yth  column  of  the  matrix  AB. 
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16.  Let  8  be  a  linear  transformation  taking  R^R".  Show  that  8(0) =0. 


17.  Let  9,(9)  be  the  linear  transformation  which  rotates  each  point  in  the  plane  by 
an  angle  6  in  the  counterclockwise  direction.  Show  that 


18.  Let  %  and  9,2  be  the  linear  transformations  which  rotate  each  point  in  the 
plane  by  angles  0j  and  02  respectively.  Then  the  linear  transformation  9t3  = 
9,\  °  9?  rotates  each  point  in  the  plane  by  an  angle  9X  +  02  (in  the  counterclock¬ 
wise  direction).  Using  Exercise  (15),  show  that 

/  cos(0!  +  02)  —  sin(^1  +  ^2)\  /cos^j  —  sin#!  \/cos02  —  sin02\ 

\  sin(0,  -I-  02 )  cos(0,  +  02 )  /  \  sin0,  cos0, ) \  sin 02  cos02  )' 

Thus,  derive  the  trigonometric  identities 

sin(0!  +  02  )  =  sin 0X  cos  02  +  cos  9X  sin  02 
cos  (9 1  +  02  )  =  cos#!  cos  02  —  sin#!  sin#2. 


19.  Let 

(a)  Verify  that  ft  is  linear. 

(b)  Show  that  every  point  (x,,x2)  on  the  unit  circle  x\  +  x2  =  1  goes  into  a  point 
on  the  circle,  x\  -I-  x2  =  2. 

20.  Let  V  be  the  space  of  all  polynomials  p(t)  of  degree  less  than  or  equal  to  3  and 
let  (Dp)(t)  =  dp(t) / dt. 

(a)  Show  that  D  is  a  linear  transformation  taking  V  into  V. 

(b)  Show  that  D  does  not  possess  an  inverse. 

21.  Let  V  be  the  space  of  all  continuous  functions  /(/),  —  oo  <  /  <  oo  and  let  (Kf)(t) 

=Slmds- 

(a)  Show  that  AT  is  a  linear  transformation  taking  V  into  V. 

(b)  Show  that  (DK)f=f  where  Df= /'. 

(c)  Let  / (/)  be  differentiable.  Show  that 

22.  A  linear  transformation  &  is  said  to  be  1  - 1  if  ®(x)^fi(y)  whenever  x^y.  In 
other  words,  no  two  vectors  go  into  the  same  vector  under  ft.  Show  that  ft  is 
1  —  1  if,  and  only  if,  ft(x)=  0  implies  that  x=0. 

23.  A  linear  transformation  ft  is  said  to  be  onto  if  the  equation  ft(x)=y  has  at  least 
one  solution  for  every  y  in  R71.  Prove  that  ft  is  onto  if,  and  only  if,  ft  is  1  —  1. 
Hint :  Show  first  that  ft  is  onto  if,  and  only  if,  the  vectors  ft(e1),...,ft(en)  are 
linearly  independent.  Then,  use  Lemma  1  to  show  that  we  can  find  a  nonzero 
solution  of  the  equation  ft(x)=0  if  ftfc1), . . . ,  ft(e")  are  linearly  dependent.  Fi¬ 
nally,  show  that  ft(e*), . . . ,  ft(ew)  are  linearly  dependent  if  the  equation  ft(x)=0 
has  a  nonzero  solution. 
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24.  Prove  directly  that  (AB)C  =  A(BQ.  Hint:  Show  that  these  matrices  have  the 
same  elements. 


25.  Show  that 


1 

1 

2 


+  c2 


if,  and  only  if,  b2  =  bx  +  b2. 


3.8  The  eigenvalue-eigenvector  method 
of  finding  solutions 

We  return  now  to  the  first-order  linear  homogeneous  differential  equation 


f  ~ 
a\\ 

...  a^n 

x  =  Ax,  x  = 

kl 

ii 

< 

rs 

• 

• 

• 

an\ 

• 

• 

• 

•  *  *  ®nn  ^ 

Our  goal  is  to  find  n  linearly  independent  solutions  x1(/),.-*,x',(/).  Now, 
recall  that  both  the  first-order  and  second-order  linear  homogeneous  scalar 
equations  have  exponential  functions  as  solutions.  This  suggests  that  we  try 
x(/)  =  ex'v,  where  v  is  a  constant  vector,  as  a  solution  of  (1).  To  this  end, 
observe  that 

eXt\  =  XeXty 
dt 

and 

A(eX/v)  =  eXt  Ay. 

Hence,  x(/)=  eXtY  is  a  solution  of  (1)  if,  and  only  if,  XeXt\  =  eXtA\.  Dividing 
both  sides  of  this  equation  by  eXt  gives 

Av  =  Av.  (2) 

Thus,  x(t)  =  eXt\  is  a  solution  of  (1)  if,  and  only  if,  X  and  v  satisfy  (2). 

Definition.  A  nonzero  vector  v  satisfying  (2)  is  called  an  eigenvector  of  A 
with  eigenvalue  X. 

Remark.  The  vector  v  =  0  is  excluded  because  it  is  uninteresting.  Obvi¬ 
ously,  AO  =  X  •  0  for  any  number  X. 

An  eigenvector  of  a  matrix  A  is  a  rather  special  vector:  under  the  linear 
transformation  x-*Ax,  it  goes  into  a  multiple  X  of  itself.  Vectors  which  are 
transformed  into  multiples  of  themselves  play  an  important  role  in  many 
applications.  To  find  such  vectors,  we  rewrite  Equation  (2)  in  the  form 

0  =  Av  —  Av  =  (A  —  AI)v.  (3) 
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But,  Equation  (3)  has  a  nonzero  solution  v  only  if  det(A-AI)  =  0.  Hence 
the  eigenvalues  A  of  A  are  the  roots  of  the  equation 

*ln 
<*2n 

.  5 

a  —A 

nn 

J 

and  the  eigenvectors  of  A  are  then  the  nonzero  solutions  of  the  equations 
(A  —  AI)v  =  0,  for  these  values  of  A. 

The  determinant  of  the  matrix  A  —  AI  is  clearly  a  polynomial  in  A  of  de¬ 
gree  n9  with  leading  term  (—  ly’A".  It  is  customary  to  call  this  polynomial 
the  characteristic  polynomial  of  A  and  to  denote  it  by  p( A).  For  each  root 
Xj  of  p( A),  that  is,  for  each  number  Xj  such  that  p(Xj)  =  0 ,  there  exists  at  least 
one  nonzero  vector  yj  such  that  A\j  =  Xj\J.  Now,  every  polynomial  of  de¬ 
gree  n  >  1  has  at  least  one  (possibly  complex)  root.  Therefore,  every  matrix 
has  at  least  one  eigenvalue,  and  consequently,  at  least  one  eigenvector.  On 
the  other  hand,  p{ A)  has  at  most  n  distinct  roots.  Therefore,  every  nXn 
matrix  has  at  most  n  eigenvalues.  Finally,  observe  that  every  nXn  matrix 
has  at  most  n  linearly  independent  eigenvectors,  since  the  space  of  all  vec¬ 
tors 

r  *\ 

»i 

v=  : 

IV 

has  dimension  n. 

Remark.  Let  v  be  an  eigenvector  of  A  with  eigenvalue  A.  Observe  that 

A(cv)  =  cAv  =  cAv  =  A(cv) 

for  any  constant  c.  Hence,  any  constant  multiple  (ct^O)  of  an  eigenvector 
of  A  is  again  an  eigenvector  of  A,  with  the  same  eigenvalue. 

For  each  eigenvector  yJ  of  A  with  eigenvalue  A.,  we  have  a  solution  xHt) 
=  e  j‘ V  of  (1).  If  A  has  n  linearly  independent  eigenvectors  v1,...,v'1  with 
eigenvalues  Xlf...,X„  respectively  (A,,...^  need  not  be  distinct),  then  xJ(t) 
—  ekJtyjJ=  l,...,rt  are  n  linearly  independent  solutions  of  (1).  This  follows 
immediately  from  Theorem  6  of  Section  3.4  and  the  fact  that  xy  (0)  =  vy.  In 
this  case,  then,  every  solution  x(t)  of  (1)  is  of  the  form 

x(?)  =  C|eA''v'  +  c2eX2'v2+  ...  +c„ex-V.  (4) 

This  is  sometimes  called  the  “general  solution”  of  (1). 

The  situation  is  simplest  when  A  has  n  distinct  real  eigenvalues 
A1,A2,...,A/i  with  eigenvectors  yl,y2,...,yn  respectively,  for  in  this  case  we 


0  =  det(  A  -  AI)  =  det 


a\\  ^ 


a 


12 


a 


21 


a22  ^ 


a 


n  1 


a 


n2 
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are  guaranteed  that  v^v2,...,!^  are  linearly  independent.  This  is  the  con¬ 
tent  of  Theorem  12. 

Theorem  12.  Any  k  eigenvectors  vl, . . . ,  v*  of  A  with  distinct  eigenvalues 
Xv...,Xk  respectively ,  are  linearly  independent . 

Proof.  We  will  prove  Theorem  12  by  induction  on  k,  the  number  of  ei¬ 
genvectors.  Observe  that  this  theorem  is  certainly  true  for  k  =  1.  Next,  we 
assume  that  Theorem  12  is  true  for  k—j.  That  is  to  say,  we  assume  that 
any  set  of  j  eigenvectors  of  A  with  distinct  eigenvalues  is  linearly  indepen¬ 
dent.  We  must  show  that  any  set  of  j  +  1  eigenvectors  of  A  with  distinct  ei¬ 
genvalues  is  also  linearly  independent.  To  this  end,  let  v1,...,v/+1  be  j  + 1 
eigenvectors  of  A  with  distinct  eigenvalues  \l9...,XJ+l  respectively.  To  de¬ 
termine  whether  these  vectors  are  linearly  dependent  or  linearly  indepen¬ 
dent,  we  consider  the  equation 

c1v1  +  c2v2+  ...  +cJ+l\J+l=0.  (5) 

Applying  A  to  both  sides  of  (5)  gives 

Viv‘+V2v2+  •••  +Vic;+iv'/+1=0-  (6) 

Thus,  if  we  multiply  both  sides  of  (5)  by  Xl  and  subtract  the  resulting  equa¬ 
tion  from  (6),  we  obtain  that 

(A2-Ai)c2v2+  ...  +(Ay+1-A,)cy+Iv/+1  =  0.  (7) 

But  v2,...,v7+1  are  j  eigenvectors  of  A  with  distinct  eigenvalues  A2,...,Ay+1 
respectively.  By  the  induction  hypothesis,  they  are  linearly  independent. 
Consequently, 


(A2  Ai)c2  — 0,  (X^  ^1)^3 0, . . . ,  and  A|^ty_j.j“ -0. 

Since  A1?  A2, . . . ,  A >-+l  are  distinct,  we  conclude  that  c2, c3,...,cj+l  are  all  zero. 
Equation  (5)  now  forces  cx  to  be  zero.  Hence,  v1,v2,...,v/+1  are  linearly  in¬ 
dependent.  By  induction,  therefore,  every  set  of  k  eigenvectors  of  A  with 
distinct  eigenvalues  is  linearly  independent.  □ 


Example  1.  Find  all  solutions  of  the  equation 


-1 


x. 


Solution.  The  characteristic  polynomial  of  the  matrix 


2  -1 
1  -lJ 
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is 

1-A  -1  4 

p  (X)  =  det(A  -  XI)  =  det  3  2-A  -1 

1  2  1  —  1  -A. 

=  -(1  +  A)(1  -A)(2  — A)  +  2+ 12  —  8(2  —  A)  +  ( 1  —  A)  —  3(1  +A) 

=  (1  —  A)(A  — 3)(A  +  2). 

Thus  the  eigenvalues  of  A  are  A,=  1,  A2  =  3,  and  A3=  —  2. 

(i)  Aj=  1:  We  seek  a  nonzero  vector  v  such  that 

0  -1  4]fthl  [O' 

(A  —  I)v  =  3  1  -1  ®2  -  0  • 

2  1  -2JKJ  loj 

This  implies  that 

—  v2  +  4v3  =  0,  3uj  +  v2  —  v3  =  0,  and  2vl  +  v2  —  2v3  =  0. 

Solving  for  c,  and  v2  in  terms  of  v3  from  the  first  two  equations  gives  c,  = 
— 1>3  and  v2  =  4t>3.  Hence,  each  vector 

-r 

v  =  c  4 

.  1. 

is  an  eigenvector  of  A  with  eigenvalue  one.  Consequently, 

f-f 

cel  4 

.  1. 

is  a  solution  of  the  differential  equation  for  any  constant  c.  For  simplicity, 
we  take 

-1' 

xl(t)  =  e‘  4  ■ 

.  1. 

(ii)  A2  =  3:  We  seek  a  nonzero  vector  v  such  that 

-2  -1  4l  [cil  [O' 

(A  -  3I)v  =  3  -1  -1  v2  =  0  ■ 

2  1  —4  0 

This  implies  that 

—  2vt  —  v2  +  4v3  =  0,  3v{  —  v2~  v3  =  0,  and  2vl  +  v2  —  4v3=:0. 

Solving  for  and  v2  in  terms  of  v3  from  the  first  two  equations  gives  u1  = 
t>3  and  v2  =  2v3.  Consequently,  each  vector 

T 

v=  c  2 
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is  an  eigenvector  of  A  with  eigenvalue  3.  Therefore, 

rn 

x2{t)  =  e3‘  2 
.1. 

is  a  second  solution  of  the  differential  equation. 

(iii)  X3=  —2:  We  seek  a  nonzero  vector  v  such  that 

3  _1  4]  r^i ]  [o' 

(A  +  2I)v  =  3  4  -1  ®2  =  0  • 

U  1  1 J  L^aJ  10. 

This  implies  that 

3t>,  —  v2+4v3  =  0,  3vt+4v2  —  v3  =  0  and  2v,  +  v2  +  v3=0. 

Solving  for  vx  and  v2  in  terms  of  v3  gives  u,  =  —  v3  and  v2  =  v3.  Hence,  each 
vector 

-r 

v  =  c  1 

.  1. 

is  an  eigenvector  of  A  with  eigenvalue  -  2.  Consequently, 

-l' 

x\t)  =  e~2‘  1 

.  1. 

is  a  third  solution  of  the  differential  equation.  These  solutions  must  be  lin¬ 
early  independent,  since  A  has  distinct  eigenvalues.  Therefore,  every  solu¬ 
tion  x(t)  must  be  of  the  form 

'-ll  f1]  f_1 

x(t)  =  cle‘  4  +c2e3'  2  +c3e~2’  1 

.  lJ  UJ  l  1. 

—  cte*+  c2e3t  —  c3e~2t 
—  4clet +  2c2e3t  +  c3e~2t  . 

6^'+  c2e3t  +  c3e~2t 

Remark.  If  X  is  an  eigenvalue  of  A,  then  the  n  equations 
0,1^1+  •••  ...  +aJnvn  =  0,  j=  1 

are  not  independent;  at  least  one  of  them  is  a  linear  combination  of  the 
others.  Consequently,  we  have  at  most  n  —  1  independent  equations  for  the 
n  unknowns  vl9...,vn.  This  implies  that  at  least  one  of  the  unknowns 
vv...,vn  can  be  chosen  arbitrarily. 
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Example  2.  Solve  the  initial-value  problem 


Solution.  The  characteristic  polynomial  of  the  matrix 


p  (X)  =  det(  1  - ' X  J  !_\  )  -  ( 1  -  X)2  -  36  =  (X  -  7)(X + 5). 


Thus,  the  eigenvalues  of  A  are  \x  =  7  and  \2  =  —5. 
(i)  Xx  =  7:  We  seek  a  nonzero  vector  v  such  that 


This  implies  that  vx  =  2v2-  Consequently,  every  vector 


is  an  eigenvector  of  A  with  eigenvalue  7.  Therefore, 

is  a  solution  of  the  differential  equation. 

(ii)  X2  =  —5:  We  seek  a  nonzero  vector  v  such  that 

<a+5,H3  'ivm 

This  implies  that  =  —2v2.  Consequently, 

(-?) 


is  an  eigenvector  of  A  with  eigenvalue  —  5,  and 


x’(«)-e-"(-2) 

is  a  second  solution  of  the  differential  equation.  These  solutions  are  lin¬ 
early  independent  since  A  has  distinct  eigenvalues.  Hence,  x(f)  =  c1x1(f)  + 
c2x2(t).  The  constants  cx  and  c2  are  determined  from  the  initial  condition 
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Thus,  2cj—  2c2=0  and  cx  +  c2=\.  The  solution  of  these  two  equations  is 
cx  =  \  and  c2  =  Consequently, 


\e7‘+L-5‘ 
2  2 


Example  3.  Find  all  solutions  of  the  equation 


x  =  Ax  = 


1 

2 

3 

4 


1  1  1 
2  2  2 

3  3  3 

4  4  4 

5  5  5 


1 

2 

3 

4 


x. 


Solution.  It  is  not  necessary  to  compute  the  characteristic  polynomial  of  A 
in  order  to  find  the  eigenvalues  and  eigenvectors  of  A.  To  wit,  observe  that 


r  -n 

X, 

f  N 

1 

x2 

2 

X3 

=  (xx  +  x2  +  x3  +  x4  +  x5) 

3 

*4 

4 

*5 

V.  J 

5 

k.  J 

Hence,  any  vector  x  whose  components  add  up  to  zero  is  an  eigenvector  of 
A  with  eigenvalue  0.  In  particular 


v'  = 

r 

i 

0 

0 

,  V*  = 

r  "n 

0 

i 

0 

s# 

u> 

II 

0 

0 

1 

,  and  v4  = 

- > 

o  o  o 

0 

-1 

L  J 

0 

-1 

J 

0 

-1 

k.  J 

1 

-1 

L  J 

are  four  independent  eigenvectors  of  A  with  eigenvalue  zero.  Moreover, 
observe  that 


v 


1 

2 

3 

4 

5 


J 


is  an  eigenvector  of  A  with  eigenvalue  15  since 


r  > 

i 

r  > 

i 

f 

1 

2 

2 

2 

A 

3 

=  (14-2  +  3  +  4  +  5) 

3 

=  15 

3 

4 

4 

4 

[5  J 

5 

^  J 

5j 
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The  five  vectors  v1,  v2,  v3,  v4,  and  v5  are  easily  seen  to  be  linearly  indepen¬ 
dent.  Hence,  every  solution  x(7)  is  of  the  form 


f  "\ 

1 

f  *N 

0 

r  > 

0 

r 

0 

1 

0 

1 

0 

0 

+  c5e15' 

2 

0 

+  c2 

0 

+  c3 

1 

+  c4 

0 

3 

0 

0 

0 

1 

4 

-1 

V.  J 

-1 

J 

-1 

C  J 

.-1. 

.5J 

Exercises 


In  each  of  Problems  1-6  find  all  solutions  of  the  given  differential  equa¬ 
tion. 

'•*-(2  "i)” 

3  2  4 

3.  x=|  2  0  2  |x 

4  2  3 

-7  0  6 

5.  x=|  0  5  0  lx 

6  0  2 


4.  x= 


6.  x= 


fl 
3 


2 

6 

10 

14 


3 

9 

15 

21 


6 

18 

30 

42 


In  each  of  Problems  7-12,  solve  the  given  initial- value  problem. 


S.x=(_ 1  3)X)  X(0)  =  (0J 


7.x=(J  ])x,  x(0)  =  (2) 

(3  1  -1\ 

9.  x=|  1  3  -1  )x,  x(0)  = 

3  3-1 


1  -1  0 

10.  x=|  1  2  1  |x,  x(0) 

1  10  2 


ojx,  x(0)=|  oj 

12.  x=|  -1  2  ljx,  x(0)=|  4 j 

13.  (a)  Show  that  ex^~to\,t0  constant,  is  a  solution  of  x= Ax  if  Av=Av. 
(b)  Solve  the  initial-value  problem 


x  = 


(see  Exercise  12). 
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14.  Three  solutions  of  the  equation  x  =  Ax  are 


el  +  e2t 

el  +  e3t 

e*  —  e ^ 

e2t 

y 

e3t 

and 

—  e3' 

0 

V.  J 

[  e3‘  \ 

—  e3'  J 

Find  the  eigenvalues  and  eigenvectors  of  A. 

15.  Show  that  the  eigenvalues  of  A-1  are  the  reciprocals  of  the  eigenvalues  of  A. 

16.  Show  that  the  eigenvalues  of  An  are  the  nth  power  of  the  eigenvalues  of  A. 

17.  Show  that  A  =  0  is  an  eigenvalue  of  A  if  detA  =  0. 

18.  Show,  by  example,  that  the  eigenvalues  of  A  +  B  are  not  necessarily  the  sum  of 
an  eigenvalue  of  A  and  an  eigenvalue  of  B. 

19.  Show,  by  example,  that  the  eigenvalues  of  AB  are  not  necessarily  the  product 
of  an  eigenvalue  of  A  with  an  eigenvalue  of  B. 

20.  Show  that  the  matrices  A  and  T-1AT  have  the  same  characteristic  polynomial. 

21.  Suppose  that  either  B-1  or  A-1  exists.  Prove  that  AB  and  BA  have  the  same 
eigenvalues.  Hint:  Use  Exercise  20.  (This  result  is  true  even  if  neither  B'  ' 1  or 
A-1  exist;  however,  it  is  more  difficult  to  prove  then.) 


3.9  Complex  roots 

If  A  =  a  +  if}  is  a  complex  eigenvalue  of  A  with  eigenvector  v= v1  +  /v2,  then 
x(t)  =  eXt\  is  a  complex-valued  solution  of  the  differential  equation 

x  =  Ax.  (1) 

This  complex-valued  solution  gives  rise  to  two  real-valued  solutions,  as  we 
now  show. 

Lemma  1.  Let  x(t)  =  y(t)  +  iz(t)  be  a  complex-valued  solution  of  ( 1).  Then, 
both  y (/)  and  z (/)  are  real-valued  solutions  of  (1). 

Proof.  If  x(0  =  y(0  +  *z(0  is  a  complex-valued  solution  of  (1),  then 

y  (t)  +  iz(t)  =  A(y(t)  +  iz(t))  =  Ay(f)  +  iAz(t).  (2) 

Equating  real  and  imaginary  parts  of  (2)  gives  y(t)  =  Ay(t)  and  z(t)  =  Az(t). 
Consequently,  both  y(f)  =  Re{x(0}  and  z(t)  =  Im{x(/)}  are  real-valued 
solutions  of  (1).  □ 

The  complex-valued  function  x(f)  =  e(a+,'^)/(v1  +  /v2)  can  be  written  in 
the  form 

x(  t)  =  eat  (cos  fit  +  i  sin  4-  i\2) 

=  e  at  [  (v1  cos  fit  —  v2  sin  (it)  +  /  (v1  sin  fit  +  v2  cos  /?/)  ] . 
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Hence,  if  X  =  a  +  ifi  is  an  eigenvalue  of  A  with  eigenvector  v  =  v1  +  /v2,  then 

y  (t)  =  eat  (yl  cos  fit  —  v2  sin  fit) 
and 


z  (t)  =  e  at  ( v1  sin  fit  +  v2  cos  fit) 

are  two  real-valued  solutions  of  (1).  Moreover,  these  two  solutions  must  be 
linearly  independent  (see  Exercise  10). 


Example  1.  Solve  the  initial- value  problem 


'1 

0 

°1 

\l) 

• 

x  = 

0 

1 

-1 

x,  x(0) = 

1 

L0 

1 

1. 

Li  J 

Solution .  The  characteristic  polynomial  of  the  matrix 


A  = 


0 

0 


0 

1 

1 


0 


is 


p  (X)  =  det(  A  —  XI)  =  det 


1-X  0  0 

0  1-X  -1 

0  1  1-X 


=  (1  -A)3  +  (l  — A)  =  (l  — A)(A2  — 2A  +  2). 
Hence  the  eigenvalues  of  A  are 


•\  -i  j  2±  V4-8 

X=  1  and  X  = - - - =  1  ±  /. 


(i)X=l:  Clearly, 


is  an  eigenvector  of  A  with  eigenvalue  1.  Hence 


0 


is  one  solution  of  the  differential  equation  x=Ax. 

(ii)  X=  1  +  /:  We  seek  a  nonzero  vector  v  such  that 


[A-(l  +  0l]v  = 


—  i 
0 
0 


0 
—  i 
1 


0 

1 


\ 

■®r 

fol 

= 

0 

J 

V3 

<  **  J 

.0. 
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This  implies  that  —ivx  —  0,  —  iv2  —  v3  =  0,  and  v2—iv 3  =  0.  The  first  equa¬ 
tion  says  that  vx  =  0  and  the  second  and  third  equations  both  say  that  v2  = 
ivy  Consequently,  each  vector 

[O' 

\=c  i 

A. 

is  an  eigenvector  of  A  with  eigenvalue  1  +  i.  Thus, 

O' 

x(/)  =  e(1  +  ,)'  i 

A. 

is  a  complex-valued  solution  of  the  differential  equation  x  =  Ax.  Now, 

oi  fro]  roi 

e(l  +  i)t  i  =  e'(cos/  +  /sin/)  0  +/  1 

A)  [UJ  lOJ 

roi  roi  roi  roi 

=  et  cos t  0  —sin t  1  +  /sin/  0  +  /cos/  1 

L 1 J  LOJ  1 1 J  LOJ 

r  o  l  o  ' 

=  el  -sin t  +ie‘  cos t  • 

cos  t  sin/ 

V  J  V.  J 

Consequently,  by  Lemma  1, 

°  1  0  ' 
x2(t)  =  e'  -sin/  and  x3(t)  =  e‘  cost 

cos  t  sin/ 

v  y  V  y 

are  real-valued  solutions.  The  three  solutions  x'(/),  x2(z),  and  x3(t)  are  lin¬ 
early  independent  since  their  initial  values 

'ii  roi  ro' 

xl(0)=  0  ,  x2(0)=  1  ,  and  x3(0)=  0 

.OJ  [oj  [1. 

are  linearly  independent  vectors  in  R3.  Therefore,  the  solution  x(t)  of  our 
initial-value  problem  must  have  the  form 

I'll  °  1  0  ' 

x(t)  =  cle‘  0  +c2e‘  -sin/  +c3el  cost  . 

.Oj  [  cost  J  [sin/. 

Setting  /  =  0,  we  see  that 

'ii  rn  roi  roi  rcr 

1  =C,  0  +c2  0  +c3  1  =  c3  . 

1  0  1  0  c2 

v  -/  V.  y  v  y  v,  y  v,  ^  J 
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Consequently  cx  =  c2  —  c3  =  1  and 


■1] 

f 

0  ] 

0 ' 

\(t)  =  e‘ 

0 

+  e‘ 

—  sin  t 

+  ef 

cos  t 

0 

»  j 

cost 

.sin*. 

r 

1 

* 

=  et 

cos^  —  sin^ 

• 

cos /  + sin/ 

V  J 

Remark.  If  v  is  an  eigenvector  of  A  with  eigenvalue  A, _then  v,  the  complex 
conjugate  of  v,  is  an  eigenvector  of  A  with  eigenvalue  A.  (Each  component 
of  v  is  the  complex  conjugate  of  the  corresponding  component  of  v.)  To 
prove  this,  we  take  complex  conjugates  of  both  sides  of  the  equation  Av  = 
Av  and  observe  that  the  complex  conjugate  of  the  vector  Av  is  Av  if  A  is 
real.  Hence,  Av  =  Av,  which  shows  that  v  is  an  eigenvector  of  A  with  eigen¬ 
value  A. 


Exercises 

In  each  of  Problems  1-4  find  the  general  solution  of  the  given  system  of 
differential  equations. 


1.  x 


1  0  0 

3.  x=  3  1  -2  x 

\  2  2  1/ 


l  -5  0 

2.  x=  1  -3  0  x 

\0  0  1/ 

/  1  0  1\ 
4.  x=  0  1  -1  x 

\  —2  0  -1/ 


In  each  of  Problems  5-8,  solve  the  given  initial-value  problem. 

5-  *=(s  -5)X’  x(0)  =  ( 2 )  X=  (4  Il)X>  ^“(s) 

/- 3  0  2\  /  0\ 

7.  x=  I  l  -l  0  )x,  x(0)  =  I  —  1  I 
\  —2  -1  0/  \ -2/ 


0 

2 

0 

O' 

['ll 

8.  x  = 

-2 

0 

0 

0 

0 

0 

0 

-3 

x, 

x(0)  = 

1 

1 

0 

0 

3 

OJ 

LoJ 

9.  Determine  all  vectors  x°  such  that  the  solution  of  the  initial-value  problem 

(l  0  — 2 \ 

x=l  0  1  0 jx,  x(0)  =  x° 

is  a  periodic  function  of  time. 


344 


3.10  Equal  roots 


10.  Let  x(t)~ex‘v  be  a  solution  of  x  =  Ax.  Prove  that  y(/)  =  Re{x(t)}  and  z (/)  = 
Im{x(7)}  are  linearly  independent.  Hint :  Observe  that  v  and  v  are  linearly  inde¬ 
pendent  in  On  since  they  are  eigenvectors  of  A  with  distinct  eigenvalues. 


3.10  Equal  roots 

If  the  characteristic  polynomial  of  A  does  not  have  n  distinct  roots,  then  A 
may  not  have  n  linearly  independent  eigenvectors.  For  example,  the  matrix 


1  O' 
1  0 
0  2. 


has  only  two  distinct  eigenvalues  Xx  =  1  and  X2  =  2  and  two  linearly  inde¬ 
pendent  eigenvectors,  which  we  take  to  be 


r 

r°' 

0 

and 

0 

.0. 

.1. 

Consequently,  the  differential  equation  x  =  Ax  has  only  pvo  linearly  inde¬ 
pendent  solutions 


T 

0 

and  e2t 

fol 

0 

.0. 

.1. 

of  the  form  eXt\.  Our  problem,  in  this  case,  is  to  find  a  third  linearly  inde¬ 
pendent  solution.  More  generally,  suppose  that  the  n  X  n  matrix  A  has  only 
k  <  n  linearly  independent  eigenvectors.  Then,  the  differential  equation 
x  =  Ax  has  only  k  linearly  independent  solutions  of  the  form  eXt\.  Our 
problem  is  to  find  an  additional  n  —  k  linearly  independent  solutions. 

We  approach  this  problem  in  the  following  ingenious  manner.  Recall 
that  x(t)  =  eatc  is  a  solution  of  the  scalar  differential  equation  x  =  ax,  for 
every  constant  c.  Analogously,  we  would  like  to  say  that  x(/)  =  eA'v  is  a 
solution  of  the  vector  differential  equation 

x  =  Ax  (1) 


for  every  constant  vector  v.  However,  eAt  is  not  defined  if  A  is  an  n  X  n 
matrix.  This  is  not  a  serious  difficulty,  though.  There  is  a  very  natural  way 
of  defining  eAt  so  that  it  resembles  the  scalar  exponential  eat\  simply  set 


.  A2/2  A  ntn 

6  *=I  + A/+  —zr. - h  . . .  4- - : —  +  . . .  . 


2! 


n\ 


It  can  be  shown  that  the  infinite  series  (2)  converges  for  all  t ,  and  can  be 
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differentiated  term  by  term.  In  particular 


si  A*+ 1 

-2-eAi  =  A  +  A2t+  ...  +  ^-r-tn+  ... 
dt  n\ 


=  A 


A  ntn 

I  +  At  +  . . .  4*  — j—  +  . . . 

n\ 


=  AeAt. 


This  implies  that  eAty  is  a  solution  of  (1)  for  every  constant  vector  v,  since 

eAt\  =  AeAty  =  A(eA'v). 


Remark.  The  matrix  exponential  eAt  and  the  scalar  exponential  eat  satisfy 
many  similar  properties.  For  example, 

( eAt )  l  =  e~At  and  (3) 

Indeed,  the  same  proofs  which  show  that  (eat)~l  —  e~at  and 
can  be  used  to  establish  the  identities  (3):  we  need  only  replace  every  a  by 
A  and  every  1  by  I.  However,  eAt+Bt  equals  eAteBt  only  if  AB  =  BA  (see  Ex¬ 
ercise  15,  Section  3.11). 


There  are  several  classes  of  matrices  A  (see  Problems  9-11)  for  which 
the  infinite  series  (2)  can  be  summed  exactly.  In  general,  though,  it  does 
not  seem  possible  to  express  eAt  in  closed  form.  Yet,  the  remarkable  fact  is 
that  we  can  always  find  n  linearly  independent  vectors  v  for  which  the  in¬ 
finite  series  eAty  can  be  summed  exactly.  Moreover,  once  we  know  n  lin¬ 
early  independent  solutions  of  (1),  we  can  even  compute  eAt  exactly.  (This 
latter  property  will  be  proven  in  the  next  section.) 

We  now  show  how  to  find  n  linearly  independent  vectors  v  for  which 
the  infinite  series  eAty  can  be  summed  exactly.  Observe  that 

eAty=e(A-\l)te\lty 

for  any  constant  X,  since  (A  —  XI)(M)  =  (XI)(A  —  XI).  Moreover, 


exit\  = 


i+xi  *+^jr"  +  — 

\2t2 

1  +  Xt  +  2|~  “h  •  •  • 


y  =  £^*y. 


Hence,  eA'v  =  eAV(A-AI)'v. 

Next,  we  make  the  crucial  observation  that  if  v  satisfies  (A— AI)mv  =  0 
for  some  integer  m,  then  the  infinite  series  e(A“XI)'v  terminates  after  m 
terms.  If  (A-XI)mv  =  0,  then  (A-AI)w+/v  is  also  zero,  for  every  positive  in¬ 
teger  /,  since 


(A  -  \l)m  +  v  =  (A  -  XI)'[  (A  -  xi)mv  ]  =  0. 
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Consequently, 

*m—  1  , 

e(A“Ai)/v  =  v  +  /(A— XI)v+  ...  4-  - - —  (A  — Xl)m  v 

(m—  1)! 


and 


— —  g\tg(A  M)/y 


=  eXt 


m-  1 


im  —  i 

v  + 1  (A — AI)v  +  . . .  +  - -  (A — XI) 

(m  1)! 


m  —  1 


This  suggests  the  following  algorithm  for  finding  n  linearly  independent 
solutions  of  (1). 

(1)  Find  all  the  eigenvalues  and  eigenvectors  of  A.  If  A  has  n  linearly  in¬ 
dependent  eigenvectors,  then  the  differential  equation  x=Ax  has  n  linearly 
independent  solutions  of  the  form  eXt\.  (Observe  that  the  infinite  series 
e(A-xi)ty  terminates  after  one  term  if  v  is  an  eigenvector  of  A  with  eigen¬ 
value  X.) 

(2)  Suppose  that  A  has  only  k<n  linearly  independent  eigenvectors. 
Then,  we  have  only  k  linearly  independent  solutions  of  the  form  eXt\.  To 
find  additional  solutions  we  pick  an  eigenvalue  X  of  A  and  find  all  vectors 
v  for  which  (A-AI)2v=0,  but  (A-XI)v^0.  For  each  such  vector  v 


£A*y=  g\tg(A  —  \l)ty  = 


f(A-XI)v] 


is  an  additional  solution  of  x  =  Ax.  We  do  this  for  all  the  eigenvalues  X  of 
A. 


(3)  If  we  still  do  not  have  enough  solutions,  then  we  find  all  vectors  v 
for  which  (A-AI)3v  =  0,  but  (A  — AI)2v^0.  For  each  such  vector  v, 


eAty  _ 


v  +  /  ( A  —  AI)v  +  (A  -  XI)2v 


is  an  additional  solution  of  x  =  Ax. 

(4)  We  keep  proceeding  in  this  manner  until,  hopefully,  we  obtain  n  lin¬ 
early  independent  solutions. 

The  following  lemma  from  linear  algebra,  which  we  accept  without 
proof,  guarantees  that  this  algorithm  always  works.  Moreover,  it  puts  an 
upper  bound  on  the  number  of  steps  we  have  to  perform  in  this  algorithm. 


Lemma  1.  Let  the  characteristic  polynomial  of  A  have  k  distinct  roots 
A p..., A*  with  multiplicities  nv...,nk  respectively.  ( This  means  that  p(X) 
can  be  factored  into  the  form  (Kx—Xfx...(\k— A)"*.)  Suppose  that  A  has 
only  Vj  <  nj  linearly  independent  eigenvectors  with  eigenvalue  A,.  Then  the 
equation  (A  — A7I)2v  =  0  has  at  least  Vj+ 1  independent  solutions.  More 
generally ,  if  the  equation  (A  —  X,I)wv  =  0  has  only  mj  <nj  independent 
solutions ,  then  the  equation  (A  — XyI)m+1v  =  0  has  at  least  mj+ 1  indepen¬ 
dent  solutions. 
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Lemma  1  clearly  implies  that  there  exists  an  integer  dj  with  dj  <  nJ9  such 
that  the  equation  (A-XJI)^v  =  0  has  at  least  /i.  linearly  independent  solu¬ 
tions.  Thus,  for  each  eigenvalue  \j  of  A,  we  can  compute  rij  linearly  inde¬ 
pendent  solutions  of  x  =  Ax.  All  these  solutions  have  the  form 


v  + 1  (A  —  AyI)v  +  . . .  + 


(4-1)! 


(A  -  \j\)dj  ‘v  . 


In  addition,  it  can  be  shown  that  the  set  of  nx+  ...  + nk  =  n  solutions  thus 
obtained  must  be  linearly  independent. 


Example  1.  Find  three  linearly  independent  solutions  of  the  differential 
equation 

1  1  O' 

x=  0  1  Ox. 

.0  0  2. 

Solution.  The  characteristic  polynomial  of  the  matrix 

110] 

A=  0  1  0 

[0  0  2J 

is  (1-A)2(2-A).  Hence  A=1  is  an  eigenvalue  of  A  with  multiplicity  two, 
and  A  =  2  is  an  eigenvalue  of  A  with  multiplicity  one. 

(i)  A  =  1:  We  seek  all  nonzero  vectors  v  such  that 

^o  i  ou*ii  ro' 

(A-I)v=  0  0  0  ®2  =  0  • 

.0  0  lj|©3j  k 

This  implies  that  v2  =  f3  =  0,  and  u,  is  arbitrary.  Consequently, 

f  1' 

xl(t)  =  e‘  0 

.0, 

is  one  solution  of  x  =  Ax.  Since  A  has  only  one  linearly  independent  eigen¬ 
vector  with  eigenvalue  1,  we  look  for  all  solutions  of  the  equation 

ro  i  oho  i  o]  ro  o  oir»ii  ro' 

(A  -  I)v  =  000  000  v=  000  «2  =  0 

.0  0  1 J  [o  0  lj  [0  0  lJkJ  [o. 

This  implies  that  o3  =  0  and  both  o,  and  v2  are  arbitrary.  Now,  the  vector 

f°l 

v=  1 

.0. 
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satisfies  (A-I)2v  =  0,  but  (A-I)v^O.  (We  could  just  as  well  choose  any 


V  =  v2 
0 


for  which  c^O.)  Hence, 


'0l  0 

x2(t)  =  eAt  1  =  ete('A~i)t  1 

.oj  [0, 

roi  [roi  fo  i  oiro 

=  e'[l+r(A-I)]  1  =e'  1  +t  0  0  0  1 

LoJ  loj  [0  0  lJlO 

roi  rn  \r 

=  e‘  1  +te'  0  —e‘  1 
0  0  0 

is  a  second  linearly  independent  solution. 

(ii)  A  =  2:  We  seek  a  nonzero  vector  v  such  that 

-1  1  0]f»i]  fo' 

(A  — 2I)v=  0  -1  0  u2  =  0  • 

.  0  0  0j[«3J  10. 

This  implies  that  vl  =  v2  =  0  and  i:3  is  arbitrary.  Hence 


x3(t)  =  e2‘  0 


is  a  third  linearly  independent  solution. 


Example  2.  Solve  the  initial-value  problem 

'2  i3i  rr 

x=  0  2  -lx,  x(0)=  2  • 

10  0  2J  1 J 

Solution.  The  characteristic  polynomial  of  the  matrix 

2  1  3' 

A=  0  2  -1 

.0  0  2J 

is  (2— A)3.  Hence  A  =  2  is  an  eigenvalue  of  A  with  multiplicity  three.  The  ei¬ 
genvectors  of  A  satisfy  the  equation 

0  1  3  °i  0 

(A  —  2I)v  =  0  0  -1  ®2  =  0  • 

.o  o  oJKJ  [0. 
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This  implies  that  v2  =  v3  =  0  and  vx  is  arbitrary.  Hence 

r 

xl(t)  =  e2t  0 

is  one  solution  of  x  =  Ax. 

Since  A  has  only  one  linearly  independent  eigenvector  we  look  for  all 
solutions  of  the  equation 

[0  1  3lf0  1  31  [0  0  [O' 

(A  —  2I)2v  =>  0  0  -1  0  0  -1  v=  0  0  0  ^2  =  0. 

.o  o  oJio  o  oj  lo  o  oJKJ  [o. 

This  implies  that  t?3  =  0  and  both  o,  and  v2  are  arbitrary.  Now,  the  vector 

fo] 

v=  1 

.0. 

satisfies  (A-2I)2v  =  0,  but  (A  — 2I)v^0.  Hence 

0  0 
x2(t)  =  eA<  1  =e2,e(A_2I)'  1 

.oj  [0. 

0]  [  fo  1  3l 1 fO 

=  t>2,[l+t(A-2I)]  1  =  el!  I  +t  0  0-1  1 

[Oj  [  [0  0  OjjlO. 

fOl  fill  ft' 

=  e2'  1  +t  0  =  e2'  ] 

.loj  loJJ  lo. 

is  a  second  solution  of  x  =  Ax. 

Since  the  equation  (A  —  2I)2v  =  0  has  only  two  linearly  independent  solu¬ 
tions,  we  look  for  all  solutions  of  the  equation 

3 

0  1  3  0  0  0  vi  0 

(A  —  2I)3v  =  0  0  - 1  v  —  0  0  0  =  0  • 

.0  0  OJ  [0  0  0j[«3J  [0. 

Obviously,  every  vector  v  is  a  solution  of  this  equation.  The  vector 

fol 

v=  0 

.lj 


350 


3.10  Equal  roots 


does  not  satisfy  (A  — 2I)2v  =  0.  Hence 


x3(/)  =  eA' 

[0] 

0 

_e2/e(A-2I)/ 

[0] 

0 

.1. 

.1. 

=  e2t 


I+/(A  — 2I)+y(A-2I): 


f O' 
0 

llj 


O' 

'  3' 

f2 

-r 

3  t-\t2 

II 

0 

4-  t 

-1 

0 

=  e2' 

—  t 

1 

^  J 

oj 

oj 

.  1 

is  a  third  linearly  independent  solution.  Therefore, 


f  'y 

x{t)  —  e2t 

^1 

rr 

0 

+  C2 

f  -y 

t 

l 

+  c3 

3  t-\t2 

—  t 

0 

k.  J 

LoJ 

1 

The  constants  cl5  c2,  and  c3  are  determined  from  the  initial  conditions 


'll 

rii 

[01 

[0] 

2 

L  i  J 

=  Cl 

o  o 

V- _ 

+  C2 

l 

.0. 

+  c3 

0 

.1. 

This  implies  that  cl  =  1,  c2  =  2,  and  c3=  1.  Hence 

1  +5/  —  ^ t 2 

2-t 
1 

For  the  matrix  A  in  Example  2,  /?(A)  =  (2  —  X)3  and  (21  — A)3 =0.  This  is 
not  an  accident.  Every  matrix  A  satisfies  its  own  characteristic  equation. 
This  is  the  content  of  the  following  theorem. 

Theorem  13  (Cayley-Hamilton).  Let p(K)=Po+P\^+  ...+(  —  l^A"  be  the 
characteristic  polynomial  of  A.  Then , 

/?(A)=/>0I+/?1A+...  +(-l)"A"-0. 

Fake  proof.  Setting  X  =  A  in  the  equation  /?(A)  =  det(A  — XI)  gives  p{ A)  = 
det(A— AI)  =  det0=0.  The  fallacy  in  this  proof  is  that  we  cannot  set  A  =  A 
in  the  expression  det(A— AI )  since  we  cannot  subtract  a  matrix  from  the  di¬ 
agonal  elements  of  A.  However,  there  is  a  very  clever  way  to  make  this 
proof  kosher.  Let  C(A)  be  the  classical  adjoint  (see  Section  3.6)  of  the 
matrix  (A— XI).  Then, 


(A-XI)C(X)-/>(X)L  (4) 
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Each  element  of  the  matrix  C(A)  is  a  polynomial  in  A  of  degree  at  most 
( n  —  1).  Therefore,  we  can  write  C(A)  in  the  form 

C(A)  =  C0  +  C1A  +  ...  +C„_1A',~1 

where  C0,...,  are  nXn  matrices.  For  example, 


A+A2 


A2 


Thus,  Equation  (4)  can  be  written  in  the  form 

(A  — AI)[C0  +  C1A+  ...  +C„_1A'?-1]  =/>oI+/?iAI+  ...+(-  \)n\nl.  (5) 


Observe  that  both  sides  of  (5)  are  polynomials  in  A,  whose  coefficients  are 
n  X  n  matrices.  Since  these  two  polynomials  are  equal  for  all  values  of  A, 
their  coefficients  must  agree.  But  if  the  coefficients  of  like  powers  of  A 
agree,  then  we  can  put  in  anything  we  want  for  A  and  still  have  equality.  In 
particular,  set  A=A.  Then, 


P  (A)  =P(^  +P 1A  +•••+(-  1  )"A" 

=  (A— AI)[Co+C,A+  ...  +C/I_1A"_1]  =0.  □ 


Exercises 


In  each  of  Problems  1-4  find  the  general  solution  of  the  given  system  of 
differential  equations. 


2.  x= 


3.  x= 


0 

0  |x  Hint:  Look  at  Example  1  of  text. 
2 


4.  x  = 


f  2  0 
0  2 
0  0 
0  0 


1 

1 

2 

1 


0 

0 

0 

2 


In  each  of  Problems  5-8,  solve  the  given  initial-value  problem 


1  1 


-4  -4  0\ 

6.  x=|  10  9  1 

-4  -3  1/ 


x,  x(0)  = 


2 

1 

1 


8.  x= 


'3 

0 

0 

-> 

0 

rn 

1 

0 

3 

0 

0 

3 

0 

0 

x,  x(0) = 

i 

i 

lo 

0 

2 

3  J 

lij 
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9.  Let 


Show  that 


10.  Let 


Prove  that 


0 

eXlt 


0 

0 


[  0  0  eXnt  J 


1 

X 

0 


1 

0 

10 


t 

1 

0 


t 

1 J 


Hint:  Write  A  in  the  form 


and  observe  that 


A  =  AI  + 


0 

0 

0 


1 

0 

0 


0 

1 

0 


eAt  =  eXtexp 


0  1  0\ 

0  0  1U 

0  0  0/ 


11.  Let  A  be  the  n  X  n  matrix 


X  1  0 
0  X  1 


0 

0 


0  0  0  ...  1 

^0  0  0  ...  X 

and  let  P  be  the  n  X  n  matrix 

0  1  0  ...  O' 

0  0  1  ...  0 


0  0  0  ...  1 
^0  0  0  ...  0^ 
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(a)  Show  that  P”=0. 
(c)  Show  that 


(b)  Show  that  (AI)P=P(AI). 


*2p2  f  n  —  1 

i+/p+l£_ +  ...  +  — - »n~l 


2! 


(n-l)l 


12.  Compute  eAt  if 
2  1 


(a)  A  = 


0  2 
0  0 
0  0 


0 

1 

2 

0 


01 
0 

l1’ 


(b)  A  = 


2 

0 

0 


1 

2 

0 


0 

1 

2 


0  0  0 


01 

0 

0 

2 


(c)  A  = 


0 

0 

0 


1 

2 

0 

0 


0 

0 

2 

0 


0 

0 

0 

2 


13.  (a)  Show  that  eT  at=T  leAT. 
(b)  Given  that 


with 


T1at= 


-1 

-1 

2 


compute  e 


At 


14.  Suppose  that p(A)  =  det(A  — AI)  has  n  distinct  roots  A^.-^A^.  Prove  directly  that 
p(A)  =  (—  ly^A  —  A! I)... (A  —  A„I)  =  0.  Hint:  Write  any  vector  x  in  the  form  x  = 
XjV1  +  . . .  +  xn\n  where  v1, . . . ,  v"  are  n  independent  eigenvectors  of  A  with  eigen¬ 
values  Al9 . . .  respectively,  and  conclude  that  p( A)x  =  0  for  all  vectors  x. 

15.  Suppose  that  A2=aA.  Find  eAt. 

16.  Let 


A  = 


(a)  Show  that  A(A  —  51)  =  0. 

(b)  Find  eAt. 


17.  Let 


1  1  1 
1  1  1 
1  1  1 
1  1  1 
1  1  1 


-( 


0 

-1 


(a)  Show  that  A2  =  —  I. 


1  0 
1  1 

1  1 

1  1 

1  lj 
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(b)  Show  that 

eA /  =  /  cos  t  sin  A 
\  —  sin/  cos  t )' 

In  each  of  Problems  18-20  verify  directly  the  Cayley-Hamilton  Theorem 
for  the  given  matrix  A. 

/  1  -1  -1\  (2  3  1\  /-I  0  1\ 

18.  A  =  2  1  -1  19.  A=  2  3  1  20.  A=  -1  3  0 

\-l  -11/  \2  3  1/  \  2  4  6/ 


3.11  Fundamental  matrix  solutions;  eAt 

If  x1^),  •  •  •  ,x"(/)  are  n  linearly  independent  solutions  of  the  differential 
equation 

x  =  Ax,  (1) 

then  every  solution  x(f)  can  be  written  in  the  form 

x(f)  =  CjX^/)  +  c2x2(t)  +  . . .  +  cnxn{t).  (2) 

Let  X(/)  be  the  matrix  whose  columns  are  x1(/),---,x',(/).  Then,  Equation 
(2)  can  be  written  in  the  concise  form  x(0  =  X(/)c,  where 


Definition.  A  matrix  X(/)  is  called  a  fundamental  matrix  solution  of  (1)  if  its 
columns  form  a  set  of  n  linearly  independent  solutions  of  (1). 


Example  1.  Find  a  fundamental  matrix  solution  of  the  system  of  differen¬ 
tial  equations 


Solution .  We  showed  in  Section  3.8  (see  Example  1)  that 


el 

-r 

4 

,  e3t 

T 

2 

and  e  2t 

-\ 

i 

.  i. 

.1. 

.  i. 

are  three  linearly  independent  solutions  of  (3).  Hence 


r  'v 


—  ef 

e3' 

-e-2‘ 

x(r)= 

2e3t 

e~2' 

e3t 

e~2‘. 

is  a  fundamental  matrix  solution  of  (3). 
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In  this  section  we  will  show  that  the  matrix  eAt  can  be  computed  di¬ 
rectly  from  any  fundamental  matrix  solution  of  (1).  This  is  rather  remark¬ 
able  since  it  does  not  appear  possible  to  sum  the  infinite  series  [I  +  Af  + 
(At)2/2\  +  . ..]  exactly,  for  an  arbitrary  matrix  A.  Specifically,  we  have  the 
following  theorem. 

Theorem  14.  Let  X(t)  be  a  fundamental  matrix  solution  of  the  differential 
equation  x  =  Ax.  Then, 

eAt  =  X(t)X~l(  0).  (4) 

In  other  words,  the  product  of  any  fundamental  matrix  solution  of  ( 1)  with 
its  inverse  at  t  =  0  must  yield  eAt. 

We  prove  Theorem  14  in  three  steps.  First,  we  establish  a  simple  test  to 
determine  whether  a  matrix-valued  function  is  a  fundamental  matrix  solu¬ 
tion  of  (1).  Then,  we  use  this  test  to  show  that  eAt  is  a  fundamental  matrix 
solution  of  (1).  Finally,  we  establish  a  connection  between  any  two  funda¬ 
mental  matrix  solutions  of  (1). 

Lemma  1.  A  matrix  X(t)  is  a  fundamental  matrix  solution  of  (1)  if,  and  only 
if,  X(t)  =  AX(t)  and  detX(0)^0.  (The  derivative  of  a  matrix-valued  func¬ 
tion  X(t)  is  the  matrix  whose  components  are  the  derivatives  of  the  corre¬ 
sponding  components  of  X(t).) 

Proof.  Let  x1(/),...,xw(/)  denote  the  n  columns  of  X(/).  Observe  that 

X(0=(x1(0,...,xn(0) 

and 

AX(t)  =  (Ax'(f), . . . ,  Ax"(t)). 

Hence,  the  n  vector  equations  x'(t)  =  Ax'(/), ...,x”(t)  =  Ax"(/)  are  equiv- 
alent  to  the  single  matrix  equation  X(/)  =  AX(/).  Moreover,  n  solutions 
x1(/),...,x,*(f)  of  (1)  are  linearly  independent  if,  and  only  if,  x1(0),...,xw(0) 
are  linearly  independent  vectors  of  R".  These  vectors,  in  turn,  are  linearly 
independent  if,  and  only  if,  detX^T^O.  Consequently,  X(7)  is  a  fundamen¬ 
tal  matrix  solution  of  (1)  if,  and  only  if,  X(/)  =  AX(7)  and  detX(0)^0.  □ 

Lemma  2.  The  matrix-valued  function  eAt  =  I  +  At  +  A2t2/1\  4-  ...  is  a 
fundamental  matrix  solution  of  (1). 

Proof.  We  showed  in  Section  3.10  that  (d/dt)eAt  =  AeAt.  Hence  eAt  is  a 
solution  of  the  matrix  differential  equation  X(t)  =  AX(t ).  Moreover,  its  de¬ 
terminant,  evaluated  at  /  =  0,  is  one  since  eA0  =  I.  Therefore,  by  Lemma  1, 
eAt  is  a  fundamental  matrix  solution  of  (1).  □ 

Lemma  3.  Let  X(t)  and  Y (t)  be  two  fundamental  matrix  solutions  of  (1). 
Then,  there  exists  a  constant  matrix  C  such  that  Y(t)  =  X(t)C. 
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Proof.  By  definition,  the  columns  x!(/), ...9xn(t)  of  X(t)  and  y1^), ..., 
y n(t)  of  Y(7)  are  linearly  independent  sets  of  solutions  of  (1).  In  particular, 
therefore,  each  column  of  Y(7)  can  be  written  as  a  linear  combination  of 
the  columns  of  X(f);  i.e.,  there  exist  constants  cjv...,cJn  such  that 

yJ(t)  =  c{x\t)  +  cJ2x2(  t)+...  +  cJnxn{t),  j=l,...,n.  (5) 

Let  C  be  the  matrix  (c^c2,  where 

ci 

cJ=  :  . 

• 

c-i 

c n 

V.  J 

Then,  the  n  equations  (5)  are  equivalent  to  the  single  matrix  equation  Y(/) 
=  X(/)C.  □ 

We  are  now  in  a  position  to  prove  Theorem  14. 

Proof  of  theorem  14.  Let  X(/)  be  a  fundamental  matrix  solution  of  (1). 
Then,  by  Lemmas  2  and  3  there  exists  a  constant  matrix  C  such  that 

eXt  =  X(t)  C.  (6) 

Setting  f  =  0  in  (6)  gives  I  =  X(0)C,  which  implies  that  C  =  X-1(0).  Hence, 
eA'  =  X(t)X_1(0).  □ 

Example  1.  Find  eAt  if 

1  1  1 
A=  0  3  2  • 

.0  0  5. 

Solution.  Our  first  step  is  to  find  3  linearly  independent  solutions  of  the 
differential  equation 

1  1  r 

x=  0  3  2  x. 

.0  0  5. 

To  this  end  we  compute 

1-X  1  1  ' 

p  (X)  =  det(A  -  AI)  =  det  0  3-A  2  =(1 -X)(3-X)(5-X). 

.  0  0  5— A, 

Thus,  A  has  3  distinct  eigenvalues  A=  1,  A  =  3,  and  A  =  5. 

(i)  A  =  1 :  Clearly, 

rr 

v‘=  0 

.0. 
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is  an  eigenvector  of  A  with  eigenvalue  one.  Hence 

f 

x1(r)  =  e/  0 

is  one  solution  of  x  =  Ax. 

(ii)  X  =  3:  We  seek  a  nonzero  solution  of  the  equation 

-2  1  1  °i  0 

(A  — 3I)v=  0  0  2  =  o  . 

.  0  0  2 J  L u3  J  lo. 

This  implies  that  t~3  =  0  and  v2  =  2t',.  Hence, 

HI 

v2=  2 

.0. 

is  an  eigenvector  of  A  with  eigenvalue  3.  Consequently, 

t 

x2(t)  =  e3t  2 
0. 

is  a  second  solution  of  x=Ax. 

(iii)  X=5:  We  seek  a  nonzero  solution  of  the  equation 

'-4  1  llf«il  [O' 

(A— 5I)v=  0  -2  2  «2  =  0  • 

.  o  o  oJhJ  [o. 

This  implies  that  v2  =  v3  and  u,  =  v3/2.  Hence, 

r  n 

v3=  2 

.2. 

is  an  eigenvector  of  A  with  eigenvalue  5.  Consequently, 

r  n 

x3(t)  =  eSl  2 

2 . 

is  a  third  solution  of  x=Ax.  These  solutions  are  clearly  linearly  indepen¬ 
dent.  Therefore, 

e'  e3‘  e5‘ 

X(0=  0  2e3'  2e5‘ 

0  0  2c5' 

is  a  fundamental  matrix  solution.  Using  the  methods  of  Section  3.6,  we 
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compute 


Therefore, 


exp 


X‘1(0)  = 


1 

0 

0 


1 

2 

0 


1 

2 

2 


1 

1 

r 

0 

3 

2 

t 

= 

0 

0 

5j 

V. 

e 

0 

0 

el 

0 

10 


-l 


e3' 

2e3t 

0 


1 

0 

0 


e5' 

2e5t 

2e5t 


0 


\ 

2 

1 

2 

0 


—  +  \e3t 


.3/ 


1 

0 

0 


0 

\ 

2 

1 

2 


\ 

2 

2 

2 

0 


0 

1 

2 

1 

2 


- \e3t  +  \e5t 
—  e3t  +  e5t 

„5t 


Exercises 

Compute  eAt  for  A  equal 


1. 


1 

1 

3 


4. 


7.  Find  A  if 


2. 


eAt  = 


0 


1 


5.  I  -1  -3  -1 

1  1-1 


2elt  —  et 

e 


—  6* 


3. 


6. 


et  —  e2t 
e*  —  e2t 


2elt  —  et 

[3e2t  —  3et  3e2t  —  3et  3e*  —  2e2t) 


0 

0 

2 


-1 


In  each  of  Problems  8-11,  determine  whether  the  given  matrix  is  a  funda¬ 
mental  matrix  solution  of  x  =  Ax,  for  some  A;  if  yes,  find  A. 


8. 


2el 


e~{  e*  +  2e~t 

—  e~‘  et  —  2e~t 

e~f  2(e‘  +  e~t) 


—  5  cos  2* 

—  5sin2 1 

3e2t 

—  2(cos2/  +  sin2/) 

2(cos2f  —  sin  2/) 

0 

cos  2 1 

sin  2/ 

e2t 
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1  t  +  1 

/2+f 

e2t 

2e~t 

e31' 

1  2(/+ 1) 

4 12 

11. 

2el 

2e~t 

e3' 

w  1  / + 2 

3  . 

3el 

e~‘ 

2e3' . 

12.  Let  be  the  solution  of  the  initial-value  problem  x  =  Ax,  x(0)  =  e'.  Show 
that  = 

13.  Suppose  that  Y(7)  =  X(/)C,  where  X(7)  and  Y(7)  are  fundamental  matrix  solu¬ 
tions  of  x=Ax,  and  C  is  a  constant  matrix.  Prove  that  detC^O. 

14.  Let  X(/)  be  a  fundamental  matrix  solution  of  (1),  and  C  a  constant  matrix  with 
detC^O.  Show  that  Y{t)=X{t)C  is  again  a  fundamental  matrix  solution  of  (1). 

15.  Let  X(t)  be  a  fundamental  matrix  solution  of  x=Ax.  Prove  that  the  solution 
x(/)  of  the  initial-value  problem  x=Ax,  x(f0)  =  x°  is  x(r)  =  X(/)X_1(/0)x0. 

16.  Let  X(/)  be  a  fundamental  matrix  solution  of  x=Ax.  Prove  that  X(t)X~l(t0)  = 

eA 

17.  Here  is  an  elegant  proof  of  the  identity  eAt+Bt  =  eAteBt  if  AB=BA. 

(a)  Show  that  X{t)  =  eAt+Bt  satisfies  the  initial-value  problem  X  =  (A  +  B)X, 
X(0)  =  I. 

(b)  Show  that  eAtB  =  BeAt  if  AB  =  BA.  {Hint:  AJB  =  BAJ  if  AB  =  BA).  Then, 
conclude  that  (d/  dt)eAteBt  =  (A  +  B)eAteBt . 

(c)  It  follows  immediately  from  Theorem  4,  Section  3.4  that  the  solution  X(/) 
of  the  initial-value  problem  X  =  (A  +  B)X,  X(0)  =  I,  is  unique.  Conclude, 
therefore,  that  eAt+B/  =  eAteBt. 


3.12  The  nonhomogeneous  equation; 
variation  of  parameters 

Consider  now  the  nonhomogeneous  equation  x  =  Ax  +  f(7).  In  this  case,  we 
can  use  our  knowledge  of  the  solutions  of  the  homogeneous  equation 


x  =  Ax  (1) 

to  help  us  find  the  solution  of  the  initial-value  problem 

x  =  Ax  +  f(/),  x(/0)  =  x°.  (2) 

Let  x 1  (/),..., x” (f)  be  /?  linearly  independent  solutions  of  the  homogeneous 
equation  (1).  Since  the  general  solution  of  (1)  is  CjX^OH- ...  +cnxn(t),  it  is 
natural  to  seek  a  solution  of  (2)  of  the  form 

x(/)  =  «i (/)xl(0  +  u2(t)x2(t) 4- ...  +  un(t)xn(t).  (3) 

This  equation  can  be  written  concisely  in  the  form  x(/)  =  X(/)ii(7)  where 
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X(/)  =  (x1(0>--->x"(0)  an<i 


“i(0 


«.(0 


Plugging  this  expression  into  the  differential  equation  x  =  Ax  +  f(f)  gives 

X(  /)u(  t)  +  X(  t  )u(  t ) = AX(  t  )u(  t )  +  f  ( t ) .  (4) 

The  matrix  X(/)  is  a  fundamental  matrix  solution  of  (1).  Hence,  X(/)  = 
A X(t),  and  Equation  (4)  reduces  to 

X(/)u(0=f(0-  (5) 

Recall  that  the  columns  of  X(/)  are  linearly  independent  vectors  of  R"  at 
every  time  /.  Hence  X“!(/)  exists,  and 


u(/)  =  X‘1(/)f(0. 


Integrating  this  expression  between  t0  and  /  gives 


Consequently, 


x 


If  X(0  is  the  fundamental  matrix  solution  eAt ,  then  Equation  (7)  sim¬ 
plifies  considerably.  To  wit,  if  X(t)  =  eAt ,  then  X-1(Y)  =  e~A5.  Hence 


x(t)  =  eAte  A/°x°  ■+■  eAt  f  e  Asf (s)ds 

JtQ 


=  eA ^  '0^x°  +  f  eA(<t  5)f(.s)6fc. 


Example  1.  Solve  the  initial- value  problem 


\l  0 

0] 

r  > 

0 

roi 

• 

x  = 

2  1 
[3  2 

-2 

1. 

x  + 

0 

,e'cos2r. 

,  x(0) = 

i 

.i. 

361 


3  Systems  of  differential  equations 


Solution.  We  first  find  eAt,  where 


A  = 


1 

2 

3 


0 

1 

2 


0 

2 

1 


To  this  end  compute 


det( A  -  AI)  =  det 


1  -A 
2 
3 


0 

1  -A 
2 


0 

-2 
1  -A 


=  (1  — A)(A2  — 2A  +  5). 


Thus  the  eigenvalues  of  A  are 


\  i  a  \  2  ±  V4  —  20  * 

A  =  1  and  A  = - - - =  1±2/. 

(i)  A  =  1:  We  seek  a  nonzero  vector  v  such  that 


(A  — I)v  = 


f0  0 
2  0 
3  2 


0 

2 

0 


'0.' 

0 

v2 

— 

0 

J 

.U3. 

.0. 

This  implies  that  o,  =  t;3  and  v2  =  -3v{/2.  Hence 


v  = 


2 

-3 

2 


is  an  eigenvector  of  A  with  eigenvalue  1.  Consequently, 


\l(t)  =  e‘ 


2 

3 

2 


is  a  solution  of  the  homogeneous  equation  x=Ax. 

(ii)  A=  1  +  2i:  We  seek  a  nonzero  vector  v  such  that 


[A-(l  +  2/')l]v  = 


-2  i  0  0 

2  -2  i  -2 

3  2  -2/ 


V\' 

O' 

v2 

— 

0 

J 

VK 

.OJ 

This  implies  that  =  0  and  u3=  —  /t>2.  Hence, 


v  = 


0 

1 

—  i 


is  an  eigenvector  of  A  with  eigenvalue  1  +2/.  Therefore, 


*(0“ 


0 

1 

•  i 


,(l  +  2/)< 
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is  a  complex- valued  solution  of  x  =  Ax.  Now, 


■  01 

o' 

[01 

1 

e(i+20t=  e'(COs2/  +  zsin2/) 

1 

• 

—  / 

0 

l  “ij 

.0. 

.lj 

[ol 

[ol 

COS  2 1 

1 

+  sin2/ 

0 

loj 

A. 

- 

Consequently, 


+  ie* 


o' 

[o' 

sin2f 

1 

—  cos  2/ 

0 

.0. 

x2(t)  =  e‘ 

'  0  ' 

cos2f 

and  x3(/)  =  e' 

'  0  ' 

sin  2/ 

sin  2/ 

w  — cos2 1 

are  real- valued  solutions  of  x  =  Ax.  The  solutions  x1,  x2,  and  x3  are  linearly 
independent  since  their  values  at  /  =  0  are  clearly  linearly  independent  vec¬ 
tors  of  R3.  Therefore, 


0 

e*  cosh 
ets\n2 1 


0 

e*sm2t 
e'cos2 C 


is  a  fundamental  matrix  solution  of  x  =  Ax.  Computing 


we  see  that 


eAt  — 


'  2  0 

01 

-1 

r 

1 

2 

0 

0 

-(0)  = 

-3  1 

0 

3 

i 

0 

2  0 

-1 

z. 

0 

-1 

J 

J 

.1 

f  2e* 

0 

0 

1 

0 

3el  e'cos2  / 
2ef  e*sm2 1 


efsm2t 
c^cos2 t 


3 

2 


1 

0 


0 

0 

1 


=  e 


1  0 
—  §  +  §  cos  2/ +  sin  2/  cos  2/ 

1  +  f  sin  2  /  -  cos  2 1  sin  2  / 


0 

sin  2/ 
cos  2 1 
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Consequently, 

f° 

x(/)  =  eA'  1 


1 

- 1  +  \cos2s  —  sin2.s 
1  —  2  sin  2^  —  cos2.y 


0 

cos  2s 
—  sin  2s 


0 

sin  2s 
cos  2s 


ds 


[es cos 2s ) 


0 

c 

0 

=  el 

cos2/  —  sin2/ 

+  eAt  1 

sin  2s  cos  2s 

cos2/  +  sin2/ 

^  j 

Jo 

cos2  2s 

ds 


0 

cos2/  —  sin2/ 

+  eA' 

0 

(1  —  cos4/)/8 

cos2/  +  sin2/ 

v.  J 

//2  +  (sin4/)/8 

> 

cos2/  — sin2/ 
cos2/  +  sin2/ 


r 


- he ' 


/sin  2/  cos  2/  —  cos  4/  cos  2/  —  sin  4/  sin  2/ 

2  8 
/cos 2/  ,  sin 4/cos 2/  —  sin2/cos4/  +  sin2/ 
~^~+  8 


cos2/-(l  +  ^/)sin2/ 
(l  +  ^/)cos2/  +  |sin2/ 


As  Example  1  indicates,  the  method  of  variation  of  parameters  is  often 
quite  tedious  and  laborious.  One  way  of  avoiding  many  of  these  calcula¬ 
tions  is  to  “guess”  a  particular  solution  i//(/)  of  the  nonhomogeneous  equa¬ 
tion  and  then  to  observe  (see  Exercise  9)  that  every  solution  x(/)  of  the 
nonhomogeneous  equation  must  be  of  the  form  ^(/)  +  ^(/)  where  *s  a 
solution  of  the  homogeneous  equation. 


Example  2.  Find  all  solutions  of  the  differential  equation 


i 

0 

O' 

1 

• 

x  = 

2 

1 

-2 

x  + 

0 

3 

2 

1J 

LoJ 
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Solution.  Let 


A= 


0 

1 

2 


We  “guess”  a  particular  solution  t p(t)  of  the  form  ip(t)  =  becl.  Plugging  this 
expression  into  (9)  gives 


or 


This  implies  that 


cbec‘  =  Abec'  + 


(A  -  cl)b  = 


0 

OJ 


1 


-1 

1  —  c 


2(c~4) 

4  +  (l  —  c)2 
1  +  3c 
4  +  (l  —  c)2 


Hence,  every  solution  x(t)  of  (9)  is  of  the  form 


\(t)  =  e‘ 

'  2 
-3 

+  c2 

'  0  ' 

cos  2/ 

+  c3 

'  0 
-sin  2/ 

.  2. 

sin  2/ 

^  j 

cos  2  ^ 

1 


e 


ct 


1  —  C 


2(c~4) 

4  +  (l  —  c)2 
1  +3c 
4+(l  — c)2 
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Remark.  We  run  into  trouble  when  c=  1  because  one  is  an  eigenvalue  of 
the  matrix 

10  O' 

2  1-2. 

.3  2  1. 

More  generally,  the  differential  equation  x  =  Ax  +  vec/  may  not  have  a 
solution  of  the  form  bect  if  c  is  an  eigenvalue  of  A.  In  this  case  we  have  to 
guess  a  particular  solution  of  the  form 

ip(/)  =  ec'[b0  +  lv+  ...  +b  k_ltk~l] 
for  some  appropriate  integer  k.  (See  Exercises  10-18). 

Exercises 

In  each  of  Problems  1-6  use  the  method  of  variation  of  parameters  to 
solve  the  given  initial-value  problem. 

Li=(-2  -2)X+(4e0°S')’  X(0)  =  (S) 

2‘  x=(i  ~-X\h  x(o)=(!) 

3’  X=(l  X(°)=(o) 

4. *=(-i  o)x+(/‘ (/>)’  x(0)=(o) 

5.  x=|o  2  ojx+|oje2',  x(0)  =  |lj 

/-I  -1  — 2\  /1\  /0\ 

6.  x=l  1  1  1  )x+  0  \e‘,  x(0)=l  0  ) 

V  2  1  3/  \o/  Vo/ 

7.  Consider  the  /ith-order  scalar  differential  equation 

L\.y\=^jpr+ai~^  +  ---+ar,y=f{t)-  (*) 

Let  v(t)  be  the  solution  of  L[y]  =  0  which  satisfies  the  initial  conditions 
>>(0)=  ...  =^('I-2)(0)  =  0, 1)(0)=  1.  Show  that 

y(t)=  f'v(t-s)f(s)ds 
Jo 

is  the  solution  of  (*)  which  satisfies  the  initial  conditions y (0)  =  . . .  =  y^n~  !)(0)  = 
0.  Hint :  Convert  (*)  to  a  system  of  n  first-order  equations  of  the  form  x  =  Ax, 


366 


3.12  The  nonhomogeneous  equation;  variation  of  parameters 


and  show  that 


®(0 

«'(o 


is  the  wth  column  of  eAt. 


8.  Find  the  solution  of  the  initial-value  problem 


d\  dy 

— -  +  -j-  —  sect  tan/, 
dt 3  dt 


y(  0)=/(0)=/'(0)  =  0. 


9.  (a)  Let  xp(t)  be  a  solution  of  the  nonhomogeneous  equation  x= Ax+f(7)  and 
let  be  a  solution  of  the  homogeneous  equation  x=Ax.  Show  that 
+  x p(t)  is  a  solution  of  the  nonhomogeneous  equation. 

(b)  Let  ^](/)  and  xp2(t)  be  two  solutions  of  the  nonhomogeneous  equation. 
Show  that  xf/^t)  —  xp2(t)  is  a  solution  of  the  homogeneous  equation. 

(c)  Let  xj/(t)  be  a  particular  solution  of  the  nonhomogeneous  equation.  Show 
that  any  other  solution  y(/)  must  be  of  the  form  y(/)=^(/)  +  ^(/)  where 
^(f)  is  a  solution  of  the  homogeneous  equation. 

In  each  of  Problems  10-14  use  the  method  of  judicious  guessing  to  find  a 
particular  solution  of  the  given  differential  equation. 

».x-(2  _})x+(!p'  »•*-(!  “])x  +  ("2f) 

/  1  3  2\  /  sin  A 

11  *v;  -in!) 

ll  2  "3\  /  l\ 

13.  x=M  1  2)X+( 

( - 1  -i  0\  /  1  \ 

14.  x  =  I  0  -4  -1  x+  t 

\  0  5  0/  W! 

15.  Consider  the  differential  equation 

x  =  Ax  +  veX/  (*) 

where  v  is  an  eigenvector  of  A  with  eigenvalue  A.  Suppose  moreover,  that  A  has 
n  linearly  independent  eigenvectors  v1, v2, . . .,vw,  with  distinct  eigenvalues 
respectively. 

(a)  Show  that  (*)  has  no  solution  xp(t)  of  the  form  ^(0  =  a^X/.  Hint:  Write  a  = 
axvl  +  ...  +a„vn. 

(b)  Show  that  (*)  has  a  solution  xf/(t)  of  the  form 

xj/(t)  =  2LeXt +  bteXt. 
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Hint:  Show  that  b  is  an  eigenvector  of  A  with  eigenvalue  A  and  choose  it  so 
that  we  can  solve  the  equation 

(A  -  AI)a  =  b  -  v. 

In  each  of  Problems  16-18,  find  a  particular  solution  of  the  given  dif¬ 
ferential  equation  of  the  form  \f/(t)  =  eXt  (a +b/)- 

/I  1  -1\  / 1 \ 

16.  x  =  I  2  3  -4  x+  2  e2t 

\4  1  -4  \  1  / 

17.  x=|  1  3  ljx+|-lje3' 

/ 3  2  4\  /  2  \ 

18.  x=  2  0  2  x+  1  e8' 

\ 4  2  3/  \2 


3.13  Solving  systems  by  Laplace  transforms 

The  method  of  Laplace  transforms  introduced  in  Chapter  2  can  also  be 
used  to  solve  the  initial-value  problem 


Let 


and 


x  =  Ax  -l-  f  ( r),  x(0)  =  x°. 


Taking  Laplace  transforms  of  both  sides  of  (1)  gives 

£{x(f)}  =  £{Ax(/)  +  f(/)}  =A£{x(/)}  -I-  £{f(0} 
=  AX(s)  +  F(s), 
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and  from  Lemma  3  of  Section  2.9, 


Hence, 


or 


sX„(s)-xn(0) 

V  J 

=  —  x°. 


e{x(/)}  = 


£{4(0} 


£{4(0} 


jX(j)-x°=AX(5)  +  F(5) 


(jI — A)X(j)  =  x° + F(j), 


1  0 
0  1 


o' 

0 


0  0  ...  1 


Equation  (2)  is  a  system  of  n  simultaneous  equations  for  Xl(s),...9Xn(s)9 
and  it  can  be  solved  in  a  variety  of  ways.  (One  way,  in  particular,  is  to  mul¬ 
tiply  both  sides  of  (2)  by  (^1— A)-1.)  Once  we  know  Xx(s)9...9Xn(s)  we  can 
find  xx{t)9...9xn{t)  by  inverting  these  Laplace  transforms. 


Example  1.  Solve  the  initial-value  problem 


*■(!  tM!h  x(0)-(?)- 


(3) 


Solution .  Taking  Laplace  transforms  of  both  sides  of  the  differential  equa¬ 
tion  gives 


or 


(j-l)X,  {s)-4X2{s)=2  + 
X ,  (j)  +  (j-1)*2(j)-1  + 


1 


s—  1 
1 

j-1  ' 


The  solution  of  these  equations  is 


X,(J)=-^r+  1 


1 


*-3  s2-l ’ 


s  —  3  (s—  !)(.?  +  l)(s  —  3) 


Now, 


— r-  =  £{2e3' },  and 


j-3 


s2—  1 


=  £{sinh/}  =  £ 


e  —  e 


369 


3  Systems  of  differential  equations 


Hence, 

x,(/)=2e3'  + 

To  invert  X2(s),  we  use  partial  fractions.  Let 

_ s _ _  A  ,  B  ,  C 

(s  —  l)(s  +  l)(s  —  3)  s  —  1  5  +  1  5  —  3 

This  implies  that 

A  (5  + 1)(5  —  3)  +  B  (5  —  1)(5  —  3)  +  C  (5  —  1)(5  + 1)  =  5. 

Setting  5  =  1,  —1,  and  3  respectively  in  (4)  gives  A  =  —  B  = 
C=§.  Consequently, 


1  1 

4  5—1 


8 


1  1  11  1 
85  +  I  8  5  —  3  , 


Exercises 

Find  the  solution  of  each  of  the  following  initial-value  problems. 

!-*=(_2  ~l)*>  x(°)  =  (?j 
2.  x=(*  ziy>  x<°>=(i) 

3-i=(l  —  2 )X ( 3er )’  X(°H?) 

4*i=(i  !)x+(-])*'  x(0)=(o) 

5.x=(3  x(0) “ ( J ) 

‘Hi  -iHZ't)’  *<»=(“ i) 

7**  =  (-2  -2)X+(4eo°S<)’  X(0)=(l) 

»•*-(-!  iHO  x<0)-(2) 

;>)*♦( ' *«»-(S) 
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11. 

12. 

13. 

14. 


(I  2  —  3  \  / 1  \ 

x=(  1  1  2  x,  x(0)=  0 

\l  -1  4/  \0/ 


3 

0 

0 

0 

rn 

15.  x  = 

1 

0 

3 

0 

0 

3 

0 

0 

x, 

x(0)  = 

i 

i 

lo 

0 

2 

3  J 

L  i  J 
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4  Qualitative  theory  of 
differential  equations 


4.1  Introduction 

In  this  chapter  we  consider  the  differential  equation 

x  =  f(r,x)  (1) 

where 

*i(0 

x=  : 

MO, 

and 


f(*,x)  = 


is  a  nonlinear  function  of  xx,..,,xn.  Unfortunately,  there  are  no  known 
methods  of  solving  Equation  (1).  This,  of  course,  is  very  disappointing. 
However,  it  is  not  necessary,  in  most  applications,  to  find  the  solutions  of 
(1)  explicitly.  For  example,  let  xx(t)  and  x2(t)  denote  the  populations,  at 
time  t ,  of  two  species  competing  amongst  themselves  for  the  limited  food 
and  living  space  in  their  microcosm.  Suppose,  moreover,  that  the  rates  of 
growth  of  xx{t)  and  x2(t)  are  governed  by  the  differential  equation  (1).  In 
this  case,  we  are  not  really  interested  in  the  values  of  and  x2{t)  at  ev¬ 
ery  time  t.  Rather,  we  are  interested  in  the  qualitative  properties  of  xx(t) 
and  x2(t).  Specically,  we  wish  to  answer  the  following  questions. 
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1.  Do  there  exist  values  and  |2  at  which  the  two  species  coexist 
together  in  a  steady  state?  That  is  to  say,  are  there  numbers  £j,  i~2  such  that 
*,(/)=£!,  x2{t)=^2  is  a  solution  of  (1)?  Such  values  £j,!2>  if  they  exist,  are 
called  equilibrium  points  of  (1). 

2.  Suppose  that  the  two  species  are  coexisting  in  equilibrium.  Suddenly, 
we  add  a  few  members  of  species  1  to  the  microcosm.  Will  xx(t)  and  x2(t) 
remain  close  to  their  equilibrium  values  for  all  future  time?  Or  perhaps  the 
extra  few  members  give  species  1  a  large  advantage  and  it  will  proceed  to 
annihilate  species  2. 

3.  Suppose  that  xx  and  x2  have  arbitrary  values  at  /  =  0.  What  happens 
as  /  approaches  infinity?  Will  one  species  ultimately  emerge  victorious,  or 
will  the  struggle  for  existence  end  in  a  draw? 

More  generally,  we  are  interested  in  determining  the  following  proper¬ 
ties  of  solutions  of  (1). 

1 .  Do  there  exist  equilibrium  values 


for  which  x(t)=x°  is  a  solution  of  (1)? 

2.  Let  ^  (/)  be  a  solution  of  (1).  Suppose  that  ip(t)  is  a  second  solution 
with  ^/(O)  very  close  to  ^  (0);  that  is,  ^(0)  is  very  close  to  <f>j(0),j=  1 

Will  i p(t)  remain  close  to  <}>(t)  for  all  future  time,  or  will  \p(t)  diverge  from 
^  (0  as  t  approaches  infinity?  This  question  is  often  referred  to  as  the  prob¬ 
lem  of  stability.  It  is  the  most  fundamental  problem  in  the  qualitative  the¬ 
ory  of  differential  equations,  and  has  occupied  the  attention  of  many 
mathematicians  for  the  past  hundred  years. 

3.  What  happens  to  solutions  x(t)  of  (1)  as  t  approaches  infinity?  Do  all 
solutions  approach  equilibrium  values?  If  they  don’t  approach  equilibrium 
values,  do  they  at  least  approach  a  periodic  solution? 

This  chapter  is  devoted  to  answering  these  three  questions.  Remarkably, 
we  can  often  give  satisfactory  answers  to  these  questions,  even  though  we 
cannot  solve  Equation  (1)  explicitly.  Indeed,  the  first  question  can  be 
answered  immediately.  Observe  that  x(/)  is  identically  zero  if  x(t)  =  x°. 
Hence,  x°  is  an  equilibrium  value  of  (1),  if,  and  only  if, 


f(;,x°)=o.  (2) 

Example  1.  Find  all  equilibrium  values  of  the  system  of  differential  equa¬ 
tions 

dx  j  dx2 

— ; —  =  1  —  Xy,  —r-  =X7  +  X?. 

dt  2  dt  1  2 
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Solution. 


x 


X 

X 


0 

1 

0 

2 


is  an  equilibrium  value  if,  and  only  if,  1  — ^2  =  0  and  (x?)3  +  *2=0.  This 
implies  that  x2  =  1  and  jc®  =  —  1 .  Hence  |  “  j  j  is  the  only  equilibrium  value 
of  this  system. 


Example  2.  Find  all  equilibrium  solutions  of  the  system 

^={x-\){y-\),  ~^={x+\)(y+\). 

Solution. 


\yoj 

is  an  equilibrium  value  of  this  system  if,  and  only  if,  ( x0 —  lX^o-  and 
(x0  +  1X^0+  1)  =  0.  The  first  equation  is  satisfied  if  either  x0  or  y0  is  1, 
while  the  second  equation  is  satisfied  if  either  xQ  or>>0  is  —  1.  Hence,  x=  1, 
y  =  -  1  and  x  =  -  1,  y  =  1  are  the  equilibrium  solutions  of  this  system. 

The  question  of  stability  is  of  paramount  importance  in  all  physical 
applications,  since  we  can  never  measure  initial  conditions  exactly.  For 
example,  consider  the  case  of  a  particle  of  mass  one  kgm  attached  to  an 
elastic  spring  of  force  constant  1  N/m  which  is  moving  in  a  frictionless 
medium.  In  addition,  an  external  force  F(/)  =  cos2/  N  is  acting  on  the 
particle.  Let  y(t)  denote  the  position  of  the  particle  relative  to  its  equi¬ 
librium  position.  Then  ( d2y/dt2)+  y  =  cos2 1.  We  convert  this  second-order 
equation  into  a  system  of  two  first-order  equations  by  setting  =  y, 
x2  =  y'.  Then, 


dx  j 

~dT  =  x» 


dx2 

dt 


=  —  xx  +  cos2l 


The  functions yx(t)  =  sin/  and^2(0  =  cos/  are  two  independent  solutions  of 
the  homogeneous  equation y "  +y  =  0.  Moreover,  y  =  —  |cos2 /  is  a  particu¬ 
lar  solution  of  the  nonhomogeneous  equation.  Therefore,  every  solution 


of  (3)  is  of  the  form 


*i  (0 

x2{t) 


— jcos2f 
f  sin2f 
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At  time  /  =  0  we  measure  the  position  and  velocity  of  the  particle  and  ob¬ 
tain  y(0)=  l,j>'(0)  =  0.  This  implies  that  cx  =0  and  c2  =  f .  Consequently,  the 
position  and  velocity  of  the  particle  for  all  future  time  are  given  by  the 
equation 

\y(o)  wo/ 

However,  suppose  that  our  measurements  permit  an  error  of  magnitude 
10  “4.  Will  the  position  and  velocity  of  the  particle  remain  close  to  the  val¬ 
ues  predicted  by  (5)?  The  answer  to  this  question  had  better  be  yes,  for 
otherwise,  Newtonian  mechanics  would  be  of  no  practical  value  to  us.  For¬ 
tunately,  it  is  quite  easy  to  show,  in  this  case,  that  the  position  and  velocity 
of  the  particle  remain  very  close  to  the  values  predicted  by  (5).  Lety(/)  and 
y\t)  denote  the  true  values  of  y(/)  andy'(0  respectively.  Clearly, 

y(t)—y  (/)  =  (|  —  c2)cos/-c1sin/ 
y\  t)—y\t)  =  —  cxcost  —  —  c2)sinf 

where  c{  and  c2  are  two  constants  satisfying 

-  10 "4<  Cj  <  10‘4,  f  -  10”4<  c2<f  +  10~4. 

We  can  rewrite  these  equations  in  the  form 

[2  -|  1  /2  C| 

ci  +  (j_c2)  cos  (/-«,),  tan  5,  =  — —j 

-*  ^2  3 


|cost  — jcos2t 
|sin/  +  3sin2t 


(5) 


21 ‘/2 


y'(t)-yV)=  c?+(|-c2)  cos(f  —  S2),  tanfi2  = 


*  —  c 
3  C2 


Hence,  both7(/)—  y(t)  2lti& y\t)—yf(t)  are  bounded  in  absolute  value  by 
[c?  +  (f -c2)2]1/2.  This  quantity  is  at  most  V2  10-4.  Therefore,  the  true 
values  of  y{t)  andy'(0  are  indeed  close  to  the  values  predicted  by  (5). 

As  a  second  example  of  the  concept  of  stability,  consider  the  case  of  a 
particle  of  mass  m  which  is  supported  by  a  wire,  or  inelastic  string,  of 
length  /  and  of  negligible  mass.  The  wire  is  always  straight,  and  the  system 
is  free  to  vibrate  in  a  vertical  plane.  This  configuration  is  usually  referred 
to  as  a  simple  pendulum.  The  equation  of  motion  of  the  pendulum  is 


+ 


8 

l 


sinj>  =  0 


where  y  is  the  angle  which  the  wire  makes  with  the  vertical  line  AO  (see 
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Figure  1).  Setting  xx  —y  and  x2  =  dy  /  dt  v/e  see  that 


dxx 

-dT  =  x» 


dx2 

dt 


g  . 

— -  smxx. 


The  system  of  equations  (6)  has  equilibrium  solutions  xx  =0,  x2  =  0,  and  xx 
=  7T,  x2  =  0.  (If  the  pendulum  is  suspended  in  the  upright  position  y  =  it 
with  zero  velocity,  then  it  will  remain  in  this  upright  position  for  all  future 
time.)  These  two  equilibrium  solutions  have  very  different  properties.  If  we 
disturb  the  pendulum  slightly  from  the  equilibrium  position  x^O,  x2  =  0 , 
by  either  displacing  it  slightly,  or  giving  it  a  small  velocity,  then  it  will  ex¬ 
ecute  small  oscillations  about  xx  =0.  On  the  other  hand,  if  we  disturb  the 
pendulum  slightly  from  the  equilibrium  position  xx  =  tt9  x2  =  0,  then  it  will 
either  execute  very  large  oscillations  about  x,=0,  or  it  will  rotate  around 
and  around  ad  infinitum.  Thus,  the  slightest  disturbance  causes  the 
pendulum  to  deviate  drastically  from  its  equilibrium  position  x,  =  7r,  x2  =  0. 
Intuitively,  we  would  say  that  the  equilibrium  value  xx  =0,  x2=0  of  (6)  is 
stable,  while  the  equilibrium  value  xx  =  n r,  x2  =  0  of  (6)  is  unstable.  This 
concept  will  be  made  precise  in  Section  4.2. 

The  question  of  stability  is  usually  very  difficult  to  resolve,  because  we 
cannot  solve  (1)  explicitly.  The  only  case  which  is  manageable  is  when 
f(/,x)  does  not  depend  explicitly  on  t\  that  is,  f  is  a  function  of  x  alone. 
Such  differential  equations  are  called  autonomous.  And  even  for  autono¬ 
mous  differential  equations,  there  are  only  two  instances,  generally,  where 
we  can  completely  resolve  the  stability  question.  The  first  case  is  when  f(x) 
=  Ax  and  it  will  be  treated  in  the  next  section.  The  second  case  is  when  we 
are  only  interested  in  the  stability  of  an  equilibrium  solution  of  x  =  f(x). 
This  case  will  be  treated  in  Section  4.3. 

Question  3  is  extremely  important  in  many  applications  since  an  answer 
to  this  question  is  a  prediction  concerning  the  long  time  evolution  of  the 
system  under  consideration.  We  answer  this  question,  when  possible,  in 
Sections  4.6 -4.8  and  apply  our  results  to  some  extremely  important  ap¬ 
plications  in  Sections  4.9-4.12. 
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Exercises 

In  each  of  Problems  1-8,  find  all  equilibrium  values  of  the  given  system  of 
differential  equations. 


^jt  —  x  —  x2  —  2  xy 
dt 

2. 

jL  =  -Pxy  +  ,i 

H 

i 

to 

1 

U> 

P 

1 

&> 

4 

II 

dx  r 

a  =ax~bxy 

4. 

<N 

1 

* 

1 

II 

^  | 

dy  . 

-  =  -cy  +  dxy 

(N 

H 

1 

1 

II 

dt  x 

dz  .  ,2 

dt 

dx  2 

dt  =Xy  X 

6. 

dx 

dt =cosy 

dy 

dt  ~xsmvy 

^  =sinx-  1 
dt 

X 

1 

1 

1 

II 

^3  <3 

8. 

dx  2 

Tt=x~y 

dt=x2+y(e*-\) 

dy  2 

-r  =  x—y 
dt  7 

dz  .  . 

=jc  +  sinz 
dt 

dz  z 

-7-  =  e  —  x 

dt 

9.  Consider  the  system  of  differential  equations 

dx  .  dy 

—j-=ax  +  by,  —r-  = 


cx  +  dy. 


dt  7  dt  7 

(i)  Show  that  x  =  0,  y  =  0  is  the  only  equilibrium  point  of  (*)  if  ad 

(ii)  Show  that  (*)  has  a  line  of  equilibrium  points  if  ad—bc  =  0. 

10.  Let  jc  =  *(/),  y  =y(/)  be  the  solution  of  the  initial-value  problem 


—  be  7^0. 


=  -x-y, 


=  2  x—y,  x(0)=y(0)  =  1. 


Suppose  that  we  make  an  error  of  magnitude  10  4  in  measuring  x(0)  andy(0). 
What  is  the  largest  error  we  make  in  evaluating  x(t),  y(t)  for  0  <  t  <  oo? 

11.  (a)  Verify  that 


x  = 


0 

0/ 


is  the  solution  of  the  initial-value  problem 


x  = 


1  1  M 

2  1  -1  x- 

-3  2  4/ 


h 


x(0)  =  0 

Vo 
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(b)  Let  x=\p(t)  be  the  solution  of  the  above  differential  equation  which  satis¬ 
fies  the  initial  condition 


x(0)  =  x°=tM  0  )• 

Vo/ 

Show  that  each  component  of  \p(t)  approaches  infinity,  in  absolute  value, 
as  t — >oo. 


4.2  Stability  of  linear  systems 

In  this  section  we  consider  the  stability  question  for  solutions  of  autono¬ 
mous  differential  equations.  Let  x=<f>(7)  be  a  solution  of  the  differential 
equation 

x  =  f(x).  (1) 

We  are  interested  in  determining  whether  ${t)  is  stable  or  unstable.  That  is 
to  say,  we  seek  to  determine  whether  every  solution  ^(7)  of  (1)  which  starts 
sufficiently  close  to  <f>(7)  at  f  =  0  must  remain  close  to  for  all  future 
time  t  >  0.  We  begin  with  the  following  formal  definition  of  stability. 

Definition.  The  solution  x  =  <j>(t)  of  (1)  is  stable  if  every  solution  \f/(t)  of  (1) 
which  starts  sufficiently  close  to  $(t)  at  t  =  0  must  remain  close  to 
for  all  future  time  t.  The  solution  ^>(7)  is  unstable  if  there  exists  at  least 
one  solution  \f/(t)  of  (1)  which  starts  near  <£(/)  at  /  =  0  but  which  does 
not  remain  close  to  <j>(t)  for  all  future  time.  More  precisely,  the  solution 
is  stable  if  for  every  e>0  there  exists  5  =  5(e)  such  that 

if  1^(0) -<*>,(0)|<  5(c),  j=l,...,n 

for  every  solution  ^(7)  of  (1). 

The  stability  question  can  be  completely  resolved  for  each  solution  of 
the  linear  differential  equation 

x  =  Ax.  (2) 

This  is  not  surprising,  of  course,  since  we  can  solve  Equation  (2)  exactly. 
We  have  the  following  important  theorem. 

Theorem  1.  (a)  Every  solution  x  =  $(t)  of  (2)  is  stable  if  all  the  eigenvalues 
of  A  have  negative  real  part. 

(b)  Every  solution  x=<£(7)  of  (2)  is  unstable  if  at  least  one  eigenvalue  of  A 
has  positive  real  part. 

(c)  Suppose  that  all  the  eigenvalues  of  A  have  real  part  <  0  and  \x  = 

A,  =  iot  have  zero  real  part.  Let  A j  =  iOj  have  multiplicity  kj.  This 
means  that  the  characteristic  polynomial  of  A  can  be  factored  into  the  form 

P  (\) =  (A  —  iox)kl ...  (A  —  io^'qfK) 
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where  all  the  roots  of  q( X)  have  negative  real  part .  Then ,  every  solution 
x  =  <p(t)  of  (1)  is  stable  if  A  has  kj  linearly  independent  eigenvectors  for 
each  eigenvalue  Xj  =  iOj.  Otherwise ,  every  solution  is  unstable. 

Our  first  step  in  proving  Theorem  1  is  to  show  that  every  solution  <f>(t)  is 
stable  if  the  equilibrium  solution  x(/)=0  is  stable,  and  every  solution  <f>(t) 
is  unstable  if  x(/)=0  is  unstable.  To  this  end,  let  \f/(t)  be  any  solution  of 
(2).  Observe  that  z(t)=4>(0~1f/(0  is  again  a  solution  of  (2).  Therefore,  if 
the  equilibrium  solution  x(/)=0  is  stable,  then  z(t)=<j>(t)-\p(t)  will  always 
remain  small  if  z(0)  =  <f>(0)  —  \f/(0)  is  sufficiently  small.  Consequently,  every 
solution  <f>(t)  of  (2)  is  stable.  On  the  other  hand  suppose  that  x(/)=0  is  un¬ 
stable.  Then,  there  exists  a  solution  x  =  h(7)  which  is  very  small  initially, 
but  which  becomes  large  as  t  approaches  infinity.  The  function  if'(t)  =  4>(0 
+  h(/)  is  clearly  a  solution  of  (2).  Moreover,  $(t)  is  close  to  4>(t)  initially, 
but  diverges  from  4 >(t )  as  t  increases.  Therefore,  every  solution  x  =  4>(t)  of 
(2)  is  unstable. 

Our  next  step  in  proving  Theorem  1  is  to  reduce  the  problem  of  showing 
that  n  quantities  \pj(t),j  =  are  small  to  the  much  simpler  problem  of 

showing  that  only  one  quantity  is  small.  This  is  accomplished  by  introduc¬ 
ing  the  concept  of  length,  or  magnitude,  of  a  vector. 

Definition.  Let 


x  = 


be  a  vector  with  n  components.  The  numbers  xv...,xn  may  be  real  or 
complex.  We  define  the  length  of  x,  denoted  by  ||x||  as 


For  example,  if 


|x||=max{|x1|,|x2|,...,|jcn|}. 


x  = 


f 
2  , 


then  ||x||  =  3  and  if 


x  = 


1+2/ 

2 

-1 


then  ||x||  =  V5  . 

The  concept  of  the  length,  or  magnitude  of  a  vector  corresponds  to  the 
concept  of  the  length,  or  magnitude  of  a  number.  Observe  that  ||x||  >0  for 
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any  vector  x  and  ||x||  =0  only  if  x  =  0.  Second,  observe  that 

||Xx||  =  max{|Xx1|,...,|Ax„|}  =  |X|max{|x1|,...,|x„|}  =  |X|||x||. 

Finally,  observe  that 

llx+y||  =  max{|x1+>'1|,...,|x)1+>'n|}  <max{|x,|  +  |>',|,...,|xn|  +  |7n|} 

<max{|x,|,...,|x„|}  +max{|>'1|,...,|^„|}  =  ||x||  +  ||y||. 

Thus,  our  definition  really  captures  the  meaning  of  length. 

In  Section  4.7  we  give  a  simple  geometric  proof  of  Theorem  1  for  the 
case  n  =  2.  The  following  proof  is  valid  for  arbitrary  n. 

Proof  of  theorem  1.  (a)  Every  solution  x  =  ^(/)  of  x  =  Ax  is  of  the  form 
i/^/)  =  eAty(0).  Let  be  the  ij  element  of  the  matrix  eAt ,  and  let 
be  the  components  of  ^(0).  Then,  the  ith  component  of  \p(t)  is 

j- 1 

Suppose  that  all  the  eigenvalues  of  A  have  negative  real  part.  Let  —  oq  be 
the  largest  of  the  real  parts  of  the  eigenvalues  of  A.  It  is  a  simple  matter  to 
show  (see  Exercise  17)  that  for  every  number  —  a ,  with  —  ax  <  —  a  <0,  we 
can  find  a  number  K  such  that  1^(01  <  Ke~at ,  t  >  0.  Consequently, 

l^rCOI  <  2  ** -“VI  2  \tj\ 

j= 1  7-1 

for  some  positive  constants  K  and  a.  Now,  |i/^|  <  ||^(0)||.  Hence, 

II ’HO  II  ==max{|^/1(/)|,.. ., |^(Q|}  <  nKe~at\\\l/{0)\\. 

Let  e>0  be  given.  Choose  8(e)  =  e/nK.  Then,  ||^(/)ll  < e  if  ll^(0)||  <8(e) 
and  t>  0,  since 

11^(011  <  nKe~at\W{0)\\  < nKe/nK=  e. 

Consequently,  the  equilibrium  solution  x(/)=0  is  stable. 

(b)  Let  X  be  an  eigenvalue  of  A  with  positive  real  part  and  let  v  be  an 
eigenvector  of  A  with  eigenvalue  X.  Then,  $(i)  =  ceXty  is  a  solution  of  x  = 
Ax  for  any  constant  c.  If  X  is  real  then  v  is  also  real  and  ||^(/)|| =  \c\eXt llvll- 
Clearly,  ||^(/)||  approaches  infinity  as  t  approaches  infinity,  for  any  choice 
of  Ct^O,  no  matter  how  small.  Therefore,  x(/)=0  is  unstable.  If  X=a  +  i/3 
is  complex,  then  v  =  vl  +  /v2  is  also  complex.  In  this  case 

e(a  +  I0)t(y\  _|_  /y2)  =  e<xt  ^cos pi  _|_  j  sjn^)(y1  -|-  /y2) 

=  eat  [  (v1  cos  fit  —  v2  sin/?/)  +  /  (v1  sin  jit  4-  v2  cos  /?/)  ] 
is  a  complex-valued  solution  of  (2).  Therefore 

\p 1  ( /)  =  ce  at  (v1  cos  fit  —  v2  sin  fit) 

is  a  real-valued  solution  of  (2),  for  any  choice  of  constant  c.  Clearly, 
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||^1(/)||  is  unbounded  as  t  approaches  infinity  if  c  and  either  v1  or  v2  is 
nonzero.  Thus,  x(7)=0  is  unstable. 

(c)  If  A  has  kj  linearly  independent  eigenvectors  for  each  eigenvalue  A 1  = 
ioj  of  multiplicity  kp  then  we  can  find  a  constant  K  such  that  l(eA%l  <  K 
(see  Exercise  18).  There,  ||^(0II  <  wA'||^(0)||  for  every  solution  ^(t)  of  (2). 
It  now  follows  immediately  from  the  proof  of  (a)  that  x(f)=0  is  stable. 

On  the  other  hand,  if  A  has  fewer  than  kj  linearly  independent  eigenvec¬ 
tors  with  eigenvalue  }^  =  iap  then  x  =  Ax  has  solutions  ip(t)  of  the  form 

( t )  =  ce Wjt  [  v  + 1  ( A  —  iojl)y  ] 

where  (A  —  iojl)\^0.  If  <^  =  0,  then  ^(/)  =  c(v  +  /Av)  is  real-valued.  More¬ 
over,  ||^(0ll  is  unbounded  as  t  approaches  infinity  for  any  choice  of  c=£ 0. 
Similarly,  both  the  real  and  imaginary  parts  of  \p(t)  are  unbounded  in  mag¬ 
nitude  for  arbitrarily  small  ^(0)^0,  if  Oj^=  0.  Therefore,  the  equilibrium 
solution  x(/)=0  is  unstable.  □ 

If  all  the  eigenvalues  of  A  have  negative  real  part,  then  every  solution 
x(t)  of  x  =  Ax  approaches  zero  as  t  approaches  infinity.  This  follows  im¬ 
mediately  from  the  estimate  ||x(/)||  <  ATe-a'||x(0)||  which  we  derived  in  the 
proof  of  part  (a)  of  Theorem  1.  Thus,  not  only  is  the  equilibrium  solution 
x(/)=0  stable,  but  every  solution  $(t)  of  (2)  approaches  it  as  t  approaches 
infinity.  This  very  strong  type  of  stability  is  known  as  asymptotic  stability. 

Definition.  A  solution  x  =  ^>(/)  of  (1)  is  asymptotically  stable  if  it  is  stable, 
and  if  every  solution  ${t)  which  starts  sufficiently  close  to  <£(/)  must  ap¬ 
proach  <£(/)  as  t  approaches  infinity.  In  particular,  an  equilibrium  solu¬ 
tion  x(/)  =  x°  of  (1)  is  asymptotically  stable  if  every  solution  x  =  \//(t)  of 
(1)  which  starts  sufficiently  close  to  x°  at  time  /  =  0  not  only  remains 
close  to  x°  for  all  future  time,  but  ultimately  approaches  x°  as  t  ap¬ 
proaches  infinity. 


Remark.  The  asymptotic  stability  of  any  solution  x  =  <£(/)  of  (2)  is  clearly 
equivalent  to  the  asymptotic  stability  of  the  equilibrium  solution  x(/)=0. 


Example  1.  Determine  whether  each  solution  x(/)  of  the  differential  equa¬ 
tion 


x  = 


X 


is  stable,  asymptotically  stable,  or  unstable. 

Solution.  The  characteristic  polynomial  of  the  matrix 
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is 


p  (X)  =  (det  A  -  XI)  =  det 


-1-X 

-2 


0 

1-X 


l  -3  -2  —  1  —X  J 

=  — (1+X)3  — 4(1+X)=— (l+X)(X2  +  2X+5). 


Hence,  A  =  -  1  and  A  =  - 1  ±2 1  are  the  eigenvalues  of  A.  Since  all  three  ei¬ 
genvalues  have  negative  real  part,  we  conclude  that  every  solution  of  the 
differential  equation  x  =  Ax  is  asymptotically  stable. 


Example  2.  Prove  that  every  solution  of  the  differential  equation 

hi  \y 

is  unstable. 

Solution .  The  characteristic  polynomial  of  the  matrix 

HI  \) 

p  (X)  =  det(A  -  XI)  =  det^  1  ~  ^  £  Wi-X)2-^ 


IS 


Hence  A  =  6  and  A  =  —  4  are  the  eigenvalues  of  A.  Since  one  eigenvalue  of 
A  is  positive,  we  conclude  that  every  solution  x  =  <f>(t)  of  x  =  Ax  is  unstable. 


Example  3.  Show  that  every  solution  of  the  differential  equation 


is  stable,  but  not  asymptotically  stable. 

Solution.  The  characteristic  polynomial  of  the  matrix 


is 


) 


p  (A)  =  det(A  -  AI)  =  det|  ^ 


Thus,  the  eigenvalues  of  A  are  A  =  ±  V6  i.  Therefore,  by  part  (c)  of  Theo¬ 
rem  1,  every  solution  x  =  <j>(t)  of  x  =  Ax  is  stable.  However,  no  solution  is 
asymptotically  stable.  This  follows  immediately  from  the  fact  that  the  gen¬ 
eral  solution  of  x  =  Ax  is 


V6  sin  V6  t  |  +  c  /  V6  cos  V6  t 


2cosV6  t)  2\  2sinV6/ 
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Hence,  every  solution  x(r)  is  periodic,  with  period  2w /  V6  ,  and  no  solu¬ 
tion  x(f)  (except  x(t)=0)  approaches  0  as  t  approaches  infinity. 


Example  4.  Show  that  every  solution  of  the  differential  equation 

'2-3  O' 
x=  0  -6  -2  x 

[-6  0  -3. 

is  unstable. 

Solution.  The  characteristic  polynomial  of  the  matrix 


'2-3  O' 

A=  0  -6  -2 

-6  0  -3, 

is 

2— A  -3  0  1 

p  (A)  =  det(A — AI)  =  det  0  -6-A  -2  =-A2(A  +  7). 

[  -6  0  —3— A. 

Hence,  the  eigenvalues  of  A  are  A  =>  —  7  and  A  =  0.  Every  eigenvector  v  of  A 
with  eigenvalue  0  must  satisfy  the  equation 


2-3  0  ®i  0 

Av=  0  -6  -2  =  0  • 

.-6  0  -3 J  [«3J  (.0. 

This  implies  that  t>,  =  3c2/2  and  v3=  —3v2,  so  that  every  eigenvector  v  of 
A  with  eigenvalue  0  must  be  of  the  form 

'  3' 

v  =  c  2  ■ 

.-6. 

Consequently,  every  solution  x=^(/)  of  x= Ax  is  unstable,  since  A=0  is  an 
eigenvalue  of  multiplicity  two  and  A  has  only  one  linearly  independent  ei¬ 
genvector  with  eigenvalue  0. 


Exercises 

Determine  the  stability  or  instability  of  all  solutions  of  the  following  sys¬ 
tems  of  differential  equations. 


( —7  1  — 6\  3  2  4\ 

5.  x=  10  -4  12  x  6.  x=  2  0  2  x 

\  2  -1  1/  \4  2  3/ 
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1  M 

2  3  U 

-1  -21 


r  o 

2 

0 

0] 

r  o 

2 

1 

°] 

9.  x= 

-2 

0 

0 

0 

0 

0 

0 

2 

x  10.  x  = 

-2 

0 

0 

0 

0 

0 

1 

2 

0 

0 

-2 

oj 

0 

0 

-2 

0 

j 

11.  Determine  whether  the  solutions  x{t)  =  0  and  x{t)  =  1  of  the  single  scalar  equa¬ 
tion  x  =  x(l  -  x)  are  stable  or  unstable. 

12.  Determine  whether  the  solutions  x{t)  =  0  and  x{t)=  1  of  the  single  scalar  equa¬ 
tion  x  =  —  x(\  —  x)  are  stable  or  unstable. 

13.  Consider  the  differential  equation  x  =  x2.  Show  that  all  solutions  x(t)  with  x(0) 
>  0  are  unstable  while  all  solutions  x(0  with  x(0)  <0  are  asymptotically  stable. 

14.  Consider  the  system  of  differential  equations 

xx  =  x2 

x2  =  +  +  4x2)1/2jx!.  (*) 

(a)  Show  that 


csin(ct  +  d) 
c2cos(ct  +  d) 


is  a  solution  of  (*)  for  any  choice  of  constants  c  and  d. 

(b)  Assume  that  a  solution  x(0  of  (*)  is  uniquely  determined  once  x^O)  and 
x2(0)  are  prescribed.  Prove  that  (a)  represents  the  general  solution  of  (*). 

(c)  Show  that  the  solution  x  =  0  of  (*)  is  stable,  but  not  asymptotically  stable. 

(d)  Show  that  every  solution  x(0^0  of  (*)  is  unstable. 

15.  Show  that  the  stability  of  any  solution  x(0  of  the  nonhomogeneous  equation 
x  =  Ax  +  f(0  is  equivalent  to  the  stability  of  the  equilibrium  solution  x  =  0  of 
the  homogeneous  equation  x  =  Ax. 

16.  Determine  the  stability  or  instability  of  all  solutions  x(/)  of  the  differential 
equation 


17.  (a)  Let  f(t)—  tae~b\  for  some  positive  constants  a  and  b ,  and  let  c  be  a  posi¬ 
tive  number  smaller  than  b.  Show  that  we  can  find  a  positive  constant  K 
such  that  |/(0|  <*<?-*,  0</<oo.  Hint:  Show  that  f(t)/e~ct  approaches 
zero  as  /  approaches  infinity. 

(b)  Suppose  that  all  the  eigenvalues  of  A  have  negative  real  part.  Show  that  we 
can  find  positive  constants  K  and  a  such  that  |(£A%|  <Ke~at  for  1  <  /, 
j<  n.  Hint :  Each  component  of  eAt  is  a  finite  linear  combination  of  func¬ 
tions  of  the  form  q(t)eXt ,  where  q{t)  is  a  polynomial  in  /  (of  degree  <  n  —  1) 
and  X  is  an  eigenvalue  of  A. 
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18.  (a)  Let  x{t)  —  eiatyy  o  real,  be  a  complex-valued  solution  of  x  =  Ax.  Show  that 

both  the  real  and  imaginary  parts  of  x(7)  are  bounded  solutions  of  x  =  Ax. 
(b)  Suppose  that  all  the  eigenvalues  of  A  have  real  part  <0  and  = 

=  hi  have  zero  real  part.  Let  A^  =  ioj  have  multiplicity  kjy  and  suppose  that 
A  has  kj  linearly  independent  eigenvectors  for  each  eigenvalue  \Jyj  =  1, . . . ,/. 
Prove  that  we  can  find  a  constant  K  such  that  |(eA%|  <  K. 

19.  Let 


*il 


and  define  || jc|| !  =  | jcj | -I- ...  +|x/I|.  Show  that 

(i)  ||x|| }  >  0  and  ||x||  t  =0  only  if  x  =  0 

(ii)  || Ax||  ,  =  1X111x11, 

(iii)  ||x+y||,  <  ||x||,  + ||y||, 

20.  Let 


x  = 


and  define  ||x||2  =  [|a:1|2+  ...  +|x„|2]1/2.  Show  that 

(i)  ||x||2>0  and  ||x||2  =  0  only  if  x  =  0 

(ii)  ||Xx||2  =  pq||x||2 

(iii)  ||x  +  y||2<||x||2  +  ||y||2 

21.  Show  that  there  exist  constants  M  and  N  such  that 

A^||x||i  <  ||x||2  <  jV|lxlli> 


4.3  Stability  of  equilibrium  solutions 


In  Section  4.2  we  treated  the  simple  equation  x  =  Ax.  The  next  simplest 
equation  is 


where 


x  =  AxH-g(x) 


£i(x) 


is  very  small  compared  to  x.  Specifically  we  assume  that 


g)(x)  g„(x) 

max{|x,|,...,|x„|}  max{|x1|,...,|xn|} 
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are  continuous  functions  of  xl9...,xn  which  vanish  for  xx  =  ...  =  xn  =  0. 
This  is  always  the  case  if  each  component  of  g(x)  is  a  polynomial  in 
xx,...,xn  which  begins  with  terms  of  order  2  or  higher.  For  example,  if 


*1*2 


then  both  xjxf/maxdxjl,  |x2|)  and  x,x2/max{|x,|,  |x2|}  are  continuous 
functions  of  xlfx2  which  vanish  for  xl  =  x2  =  0. 

If  g(0)=0  then  x(7)=0  is  an  equilibrium  solution  of  (1).  We  would  like 
to  determine  whether  it  is  stable  or  unstable.  At  first  glance  this  would 
seem  impossible  to  do,  since  we  cannot  solve  Equation  (1)  explicitly.  How¬ 
ever,  if  x  is  very  small,  then  g(x)  is  very  small  compared  to  Ax.  Therefore, 
it  seems  plausible  that  the  stability  of  the  equilibrium  solution  x(/)=0  of 
(1)  should  be  determined  by  the  stability  of  the  “approximate”  equation 
x  =  Ax.  This  is  almost  the  case  as  the  following  theorem  indicates. 


Theorem  2.  Suppose  that  the  vector-valued  function 

g(*)/IMI  =g(x)/max{|*il>  •  •  • ,  |*„|} 

is  a  continuous  function  of  xx,...,xn  which  vanishes  for  x  =  0.  Then , 

(a)  The  equilibrium  solution  x(/)  =  0  of  (1)  is  asymptotically  stable  if  the 
equilibrium  solution  x(/)=0  of  the  “ linearized ”  equation  x  =  Ax  is  asymp¬ 
totically  stable.  Equivalently ,  the  solution  x(/)=0  of  (1)  is  asymptotically 
stable  if  all  the  eigenvalues  of  A  have  negative  real  part. 

(b)  The  equilibrium  solution  x(/)=0  of  (1)  is  unstable  if  at  least  one  eigen¬ 
value  of  A  has  positive  real  part. 

(c)  The  stability  of  the  equilibrium  solution  x(/)=0  of  (1)  cannot  be  de¬ 
termined  from  the  stability  of  the  equilibrium  solution  x(/)=0  of  x  —  Ax  if 
all  the  eigenvalues  of  A  have  real  part  <  0  but  at  least  one  eigenvalue  of  A 
has  zero  real  part. 

Proof,  (a)  The  key  step  in  many  stability  proofs  is  to  use  the  variation  of 
parameters  formula  of  Section  3.12.  This  formula  implies  that  any  solution 
x(0  of  (1)  can  be  written  in  the  form 

x(  t)  =  eA'x(0)  +  f  eA(t  ~  S*g(x(s))  ds.  (2) 

We  wish  to  show  that  ||x(/)||  approaches  zero  as  t  approaches  infinity.  To 
this  end  recall  that  if  all  the  eigenvalues  of  A  have  negative  real  part,  then 
we  can  find  positive  constants  K  and  a  such  that  (see  Exercise  17,  Section 
4.2). 

||eA/x(0)||  <  A>-a/||x(0)|| 
and 

||<?A('-f)g(x(j))||  <  ^-“('-■')||g(x(i))||. 
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Moreover,  we  can  find  a  positive  constant  a  such  that 

||g(x)||<^||x||  if  ||x||  <  a. 

This  follows  immediately  from  our  assumption  that  g(x)/||x||  is  continuous 
and  vanishes  at  x  =  0.  Consequently,  Equation  (2)  implies  that 

||x(0||  <  ||eA'x(0)||  +  /’VA('-)g(xW)ll^ 

Jo 

<  J^-“/||x(0)||  +  £  f 'e-°('-1>||x(j)||d!y 

L  Jq 

as  long  as  ||x(^)||  <  a,  0  <  s  <  t.  Multiplying  both  sides  of  this  inequality  by 
eat  gives 

<?“'||x(/)||  <  /i:||x(0)||  +  ^  £eas\\x(s)\\ds.  (3) 

The  inequality  (3)  can  be  simplified  by  setting  z(/)  =  ea'||x(/)||,  for  then 

2(/)<  JT||x(0)||  +  yJ^z(s)<fc.  (4) 

We  would  like  to  differentiate  both  sides  of  (4)  with  respect  to  t.  However, 
we  cannot,  in  general,  differentiate  both  sides  of  an  inequality  and  still  pre¬ 
serve  the  sense  of  the  inequality.  We  circumvent  this  difficulty  by  the 
clever  trick  of  setting 

U(t)-^r  f  z(s)ds. 

L  Jq 

Then 

dU{t)  a  a  a 

or 

dU  (/)  n  m  If 

^p-ft/(0<^||x(0)||. 

Multiplying  both  sides  of  this  inequality  by  the  integrating  factor  e~at/2 
gives 

j-e~a,/2U<^Y  l|x(0)||e-“'/2, 
or 

|-e-“'/2[t/(0  +  tf||x(0)||]<0. 

Consequently, 

e-“'/2[C/(0  +  ^||x(0)||]<t/(0)  +  /:||x(0)||  =  ^||x(0)||, 
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so  that  1/(0  <  -#||x(0)||  +  #||x(0)||eat/2.  Returning  to  the  inequality  (4), 
we  see  that 


llx(0ll  —  e~a,z{t)  <  e-“'[  A'||x(0)||  +  £/(*)] 

<  A'||x(0)||e“a'/2  (5) 


as  long  as  ||x(0||  <  a,  0  <  s  <  /.  Now,  if  ||x(0)||  <  a/ K,  then  the  inequality 
(5)  guarantees  that  ||x(/)||  <  a  for  all  future  time  t.  Consequently,  the  in¬ 
equality  (5)  is  true  for  all  t>  0  if  ||x(0)||  <o/K.  Finally,  observe  from  (5) 
that  ||x(0||  <  AT||x(0)||  and  ||x(/)||  approaches  zero  as  t  approaches  infinity. 
Therefore,  the  equilibrium  solution  x(/)=0  of  (1)  is  asymptotically  stable. 

(b)  The  proof  of  (b)  is  too  difficult  to  present  here. 

(c)  We  will  present  two  differential  equations  of  the  form  (1)  where  the 
nonlinear  term  g(x)  determines  the  stability  of  the  equilibrium  solution  x(/) 
=0.  Consider  first  the  system  of  differential  equations 


dxx 

dt 


The  linearized  equation  is 


d_(x')  =  (  0  lV-M 

dt\x  2)  \-l  o/w’ 


and  the  eigenvalues  of  the  matrix 


( 


i 

0 


) 


are  ±i.  To  analyze  the  behavior  of  the  nonlinear  system  (6)  we  multiply 
the  first  equation  by  xv  the  second  equation  by  x2  and  add;  this  gives 


dxl 


dx 


xx-^- +  x2— =  -  x](x\+  x\)  -x\(x\  +  xj) 


dt 


—  —  x|)2. 


But 


Hence, 


dX\  dx 2  1  d  /  2  ,  2\ 

x'nr*^-sr2dSx'*x^ 

(*i + *!)  ■  -  + xlf- 


This  implies  that 

where 


C  =  X2(0)  +  X2(0). 
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Thus,  x\(t)  +  x2(i)  approaches  zero  as  t  approaches  infinity  for  any  solu¬ 
tion  xx(t),  x2(t)  of  (6).  Moreover,  the  value  of  x\  +  x\  at  any  time  /  is  al¬ 
ways  less  than  its  value  at  /= 0.  We  conclude,  therefore,  that  x2(t) 

=0  is  asymptotically  stable. 

On  the  other  hand,  consider  the  system  of  equations 


dxx 

dt 


+  xx 


(x^  +  xf). 


dx2 

dt 


Here  too,  the  linearized  system  is 


In  this  case,  though,  (d/dt)(xx  +  xj)  =  2(xx  +xj)2.  This  implies  that 

c  _  v-2  /n\  _i_  v2 


1  —  2cf  ’ 


c  =  xf  (0)  +  x2(0). 


Notice  that  every  solution  xx(t)9  x2(t)  of  (7)  with  x?(0)  +  x2(0)^0  ap¬ 
proaches  infinity  in  finite  time.  We  conclude,  therefore,  that  the  equi¬ 
librium  solution  x ,  ( t) = 0,  x2{ t) = 0  is  unstable.  □ 


Example  1.  Consider  the  system  of  differential  equations 


dxx 

dt 

dx2 

dt 

dx  3 

dt 


=  —  2xx  +  x2  +  3x3  +  9x\ 


=  —  —  5x^  +  1  x\ 


=  —  x3  +  xf  +  x2. 


Determine,  if  possible,  whether  the  equilibrium  solution  x ,(/)=(),  x2(t)= 0, 
x3(/)=0  is  stable  or  unstable. 

Solution .  We  rewrite  this  system  in  the  form  x  =  Ax  +  g(x)  where 


'*r 

'-2 

1 

3' 

'  9x32  ' 

X2 

,  A  = 

0 

-6 

-5 

and  g(x)  = 

lx\ 

*3 

V.  J 

0 

0 

-1 

J 

x\+x\ 

The  function  g(x)  satisfies  the  hypotheses  of  Theorem  2,  and  the  eigenval¬ 
ues  of  A  are  —2,  —6  and  —  1.  Hence,  the  equilibrium  solution  x(7)=0  is 
asymptotically  stable. 


Theorem  2  can  also  be  used  to  determine  the  stability  of  equilibrium 
solutions  of  arbitrary  autonomous  differential  equations.  Let  x°  be  an 
equilibrium  value  of  the  differential  equation 

x=f(x)  (8) 
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and  set  z(/)  =  x(Q  — x°.  Then 

z  =  x  =  f(x°  +  z).  (9) 

Clearly,  z(7)=0  is  an  equilibrium  solution  of  (9)  and  the  stability  of  x(Q  = 
x°  is  equivalent  to  the  stability  of  z(/)=0. 

Next,  we  show  that  f(x°  +  z)  can  be  written  in  the  form  f(x°+z)  =  Az  + 
g(z)  where  g(z)  is  small  compared  to  z. 


Lemma  1.  Let  f(x)  have  two  continuous  partial  derivatives  with  respect  to 
each  of  its  variables  xx, Then ,  f(x°  +  z)  can  be  written  in  the  form 

f(x°  +  z)  =  f(x°)  4-  Az  -I-  g(z)  (10) 

where  g(z)/max(|z1|, . . . , \zn\)  is  a  continuous  function  of  z  which  vanishes 
for  z  =  0. 

Proof  #1.  Equation  (10)  is  an  immediate  consequence  of  Taylor’s  Theo¬ 
rem  which  states  that  each  component  j£(x°  +  z)  of  f(x°  +  z)  can  be  written 
in  the  form 


fj(\°  +  z)=fj(\°)  + 


%  (x°) 


zx+ ...  4- 


a  Zn+gjW 


where  g/(z)/max{|z1|,...,|z/I|}  is  a  continuous  function  of  z  which  vanishes 
for  z  =  0.  Hence, 


f  (x°  4-  z)  =  f(x°)  4  Az  +  g(z) 


where 


3/,  (x°) 

3/,  (x°) 

A  •  •  • 

dx  | 

• 

• 

9  x„ 

• 

• 

• 

3/n  (X°) 

• 

3/n  (X°) 

A  •  •  • 

dx  j 

3x„ 

Proof  #2.  If  each  component  of  f(x)  is  a  polynomial  (possibly  infinite)  in 
then  each  component  of  f(x°  +  z)  is  a  polynomial  in  zu...,zn. 

Thus, 

fj(x°+z)  =  aj0+ajlzl  + ...  +aJnz„  +  gj(z)  (11) 

where  gj{ z)  is  a  polynomial  in  z zn  beginning  with  terms  of  order  two. 
Setting  z=0  in  (11)  gives  fj(x°)  —  aj0.  Hence, 


f  (x°  4-z)  =  f  (x°)  4-  Az  4-  g(z),  A  = 


j 
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and  each  component  of  g(z)  is  a  polynomial  in  zu...,zn  beginning  with 
terms  of  order  two.  □ 


Theorem  2  and  Lemma  1  provide  us  with  the  following  algorithm  for 
determining  whether  an  equilibrium  solution  x(/)=x°  of  x  =  f(x)  is  stable 
or  unstable: 

1.  Setz=x  — x°. 

2.  Write  f(x°+z)  in  the  form  Az  +  g(z)  where  g(z)  is  a  vector- valued  poly¬ 
nomial  in  zx,...,zn  beginning  with  terms  of  order  two  or  more. 

3.  Compute  the  eigenvalues  of  A.  If  all  the  eigenvalues  of  A  have  negative 
real  part,  then  x(/)=x°  is  asymptotically  stable.  If  one  eigenvalue  of  A 
has  positive  real  part,  then  x(/)=x°  is  unstable. 


Example  2.  Find  all  equilibrium  solutions  of  the  system  of  differential 
equations 


dx 

dt 


=  1  -xy. 


dy  3 

Tt=x~y 


and  determine  (if  possible)  whether  they  are  stable  or  unstable. 

Solution .  The  equations  1  —  xy  =  0  and  x  —  y 3  =  0  imply  that  x  =  1 ,  y  =  1  or 
x=  —  y  =  —  l.  Hence,  x(t)=  1,  y(t)=  1,  and  x(t)=  —  1, y(t)  =  —  1  are  the 
only  equilibrium  solutions  of  (12). 

(i)  x(t)  =  1,  >>(/)=  1:  Set  u  =  x  —  1,  v  =y  —  1.  Then, 

%  =  %  =  l-{l  +  u)(l  +  v)=-u-v-uv 

“  Yt  =  ( 1  +  “)  "  ( 1 +  c)3  =  u  -  3  °  -  3  v2  ~  V3- 

We  rewrite  this  system  in  the  form 


d_ 

dt 


uv 

3v2  +  v3 


)• 


The  matrix 

("!  :J) 

has  a  single  eigenvalue  A  =  —  2  since 


det(  \  X  _3M  =  (l+A)(3  +  A)+l  =  (A  +  2)2 

Hence,  the  equilibrium  solution  x(t)=  1,  y(t)=  1  of  (12)  is  asymptotically 
stable. 

(ii)  x(t)=  —  1  ,y(t)=  —  1:  Set  w  =  jc  +  1,  u  =  y  +  1.  Then, 

^=^=(u-l)-(v~l)3=u-:iv+3v2-v3- 
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We  rewrite  this  system  in  the  form 

K"H! 

The  eigenvalues  of  the  matrix 

(!  -1) 

are  A,  =  -  1  -  V5  ,  which  is  negative,  and  X2  =  —  1  +  V5  ,  which  is  positive. 
Therefore,  the  equilibrium  solution  x(7)=  —  1,  >>(0=  “  1  of  (12)  is  unsta¬ 
ble. 

Example  3.  Find  all  equilibrium  solutions  of  the  system  of  differential 
equations 


(13) 


and  determine  whether  they  are  stable  or  unstable. 

Solution.  The  equilibrium  points  of  (13)  are  determined  by  the  two  equa¬ 
tions  sin(x+>')  =  0  and  ex—  1  =0.  The  second  equation  implies  that  x  =  0, 
while  the  first  equation  implies  that  x  +y  =  wr,  n  an  integer.  Consequently, 
x(/)=0, y(t)=m,  n  =  0,  ±  1,  ±2,...,  are  the  equilibrium  solutions  of  (13). 
Setting  u  =  x,  v  =y  —  m ,  gives 


^  =  sin(w  +  v  4-  m ), 


—  =  eu _  \ 
dt 


Now,  sin(w  +  u  +  mr)  =  cos  mr  sin(w  +  v)  =  ( —  1)”  sin(«  + 1>).  Therefore, 


^  l)"sin(«  +  o). 


dt 


Next,  observe  that 


(w  +  c>)3 

sin(w  + t>)  =  w  +  o - — - 1- .. 


,u 


Hence, 


du _ /  |  \ 


n 


3! 


(«  +  vf 

(u  +  v)-K—^-  + 


.  u 

1  =  M+  —  +  . . .  . 


dv  .  u2  . 
dt  «+2!+.... 


We  rewrite  this  system  in  the  form 


<L(»)J(- 1)”  (-1)" 

dt\v>  l  !  o 


The  eigenvalues  of  the  matrix 


(-i) 

i 


n 


“ )  +  terms  of  order  2  or 
higher  in  u  and  v. 


(-1) 

0 


n 
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are 


,  (-ir-Vi+4(-i)n  A  (-ir+Vi+4(-i)" 

Ai“  2  ’  A2“  2 

When  n  is  even,  Xx  =(1  —  V5  )/2  is  negative  and  X2  =  ( 1  +  V5  )/2  is  posi¬ 
tive.  Hence,  x{t)=0,y{t)=mr  is  unstable  if  n  is  even.  When  n  is  odd,  both 
Xj  =  ( —  1  —  V3  i) /2  and  A2  =  (  —  1  +  V3  /) / 2  have  negative  real  part.  There¬ 
fore,  the  equilibrium  solution  x(f)=0,  y(t)  =  m  is  asymptotically  stable  if 
n  is  odd. 


Exercises 


Find  all  equilibrium  solutions  of  each  of  the  following  systems  of  equa¬ 
tions  and  determine,  if  possible,  whether  they  are  stable  or  unstable. 


1.  x  =  x  —  x3  —  xy2 
y  =  2y-y5-yx4 

4.  x  =  6x  —  6x2  —  2xy 
y =4y  —  4y2  — 2.xy 


2.  x  =  x2+y2-\ 

•  2  2 
y  =  xz-yz 

5.  x  =  tan (x+y) 
y  =  x  +  x3 


3.  x  =  x2+y2-\ 
y  —  2xy 

6.  x  =  ey  —  x 
y  =  ex+y 


Verify  that  the  origin  is  an  equilibrium  point  of  each  of  the  following  sys¬ 
tems  of  equations  and  determine,  if  possible,  whether  it  is  stable  or  unsta¬ 
ble. 


7.  x 

y 

10.  x 

y 

12.  x 

y 

14.  x 

y 

z 

16.  x 

y 


=y  +  3x2 
=  x  —  3y2 

=  ln(l  +  x+y2) 

=  — y +  x3 

=  8x  —  3y  +  ey  —  1 
=  sin  x2  —  ln(l  —  x  —y) 

=  x—y  +  z2 
=y  +  z  —  x2 
=  z  —  x+y2 

=  ln(l  -z) 

=  ln(l  —  x) 

ln(l~y) 


8.  x  =y  +  cosy  —  1 
y  =  —  sinx  +  x3 


9.  x  =  ex+y-l 
y  =  sin(x  +y) 


11.  x  =  cosy  —  sinx— 1 
y=x-y-y 2 

13.  x=  —  x—  y  —  (x2+y2)2/2 
y  =  x-y+(x2+y2)2/2 

15.  x  =  ex+y+z-l 
y  =  sin(x  +y  +  z) 
z-x-y-z2 

17.  x  =  x  —  cosy  —  z  + 1 
y  —y  —  cosz  —  x  +  1 
i  =  z  —  cosx— y  -I- 1 


z  = 

18  (a)  Find  all  equilibrium  solutions  of  the  system  of  differential  equations 


dx  , 

—  =gz  —  hx. 


dy_ 

dt 


a  +  bx 


- ky , 


dz 

dt 


=  ey  -fz. 


(This  system  is  a  model  for  the  control  of  protein  synthesis.) 

(b)  Determine  the  stability  or  instability  of  these  solutions  if  either  g,  e,  or  c  is 
zero. 
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4.4  The  phase-plane 

In  this  section  we  begin  our  study  of  the  “geometric”  theory  of  differentia! 
equations.  For  simplicity,  we  will  restrict  ourselves,  for  the  most  part,  tc 
the  case  n  =  2.  Our  aim  is  to  obtain  as  complete  a  description  as  possible  ol 
all  solutions  of  the  system  of  differential  equations 

f=/(^),  t=8{x’y)-  (1) 

To  this  end,  observe  that  every  solution  x  =  x(t)9  y=y(t)  of  (1)  defines  a 
curve  in  the  three-dimensional  space  t,x9y.  That  is  to  say,  the  set  of  all 
points  (/,*(/), >>(/))  describe  a  curve  in  the  three-dimensional  space  t,x,y. 
For  example,  the  solution  x  =  cos/,  >>  =  sin/  of  the  system  of  differential 
equations 

dx  dy 


describes  a  helix  (see  Figure  1)  in  (t9x,y)  space. 

The  geometric  theory  of  differential  equations  begins  with  the  important 
observation  that  every  solution  x  =  x(t),  y  =y(t),  t0<  t  <  tv  of  (1)  also  de¬ 
fines  a  curve  in  the  x—y  plane.  To  wit,  as  /  runs  from  /0  to  tl9  the  set  of 
points  (x(t),y(tj)  trace  out  a  curve  C  in  the  x—y  plane.  This  curve  is 
called  the  orbit ,  or  trajectory ,  of  the  solution  x  =  x(t),y=y(t)9  and  the  x  —y 
plane  is  called  the  phase-plane  of  the  solutions  of  (1).  Equivalently,  we  can 
think  of  the  orbit  of  x(t),y(t)  as  the  path  that  the  solution  traverses  in  the 
x—y  plane. 


Example  1.  It  is  easily  verified  that  x  =  cos/,  y  =  sin/  is  a  solution  of  the 
system  of  differential  equations  x  =  —y,  y  =  x.  As  t  runs  from  0  to  2m,  the 
set  of  points  (cos f, sin t)  trace  out  the  unit  circle  x2+>>2=1  in  the  x—y 
plane.  Hence,  the  unit  circle  x2+>>2=1  is  the  orbit  of  the  solution 
x  =  cost,  y  =  sin/,  0</<27t.  As  t  runs  from  0  to  oo,  the  set  of  points 
(cos/, sin/)  trace  out  this  circle  infinitely  often. 
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Example  2.  It  is  easily  verified  that  x  =  e~t  cos  /,  y  =  e  ~ '  sin  t,  —  oo  <  /  <  oo, 
is  a  solution  of  the  system  of  differential  equations  dx/dt=  —  x—y,  dy  / dt 
=  x—y.  As  t  runs  from  —  oo  to  oo,  the  set  of  points  (e-'cos/,e“'sin/) 
trace  out  a  spiral  in  the  x—y  plane.  Hence,  the  orbit  of  the  solution 
x  =  e~tcost, y  =  e~‘sint  is  the  spiral  shown  in  Figure  2. 


X 


Figure  2.  Orbit  of  x  =  e  '  cos t,y  =  e  1  sin t 

Example  3.  It  is  easily  verified  that  x  =  3f  +  2, y  =  5t  +  l,  — oo<t<ooisa 
solution  of  the  system  of  differential  equations  dx/dt  =  3,  dy /dt  =  5.  As  t 
runs  from  —  oo  to  oo,  the  set  of  points  (3/ 4- 2, 5/ +  7)  trace  out  the  straight 
line  through  the  point  (2,7)  with  slope  f .  Hence,  the  orbit  of  the  solution 

*  =  3/ +  2, y  =  5/ +  7  is  the  straight  line^  =  |(x  — 2)  +  7,  —  oo<x<oo. 


Example  4.  It  is  easily  verified  that  x  =  3/2  +  2,  >>  =  5/2  +  7,  0<t<oo  is  a 
solution  of  the  system  of  differential  equations 

$  =6[Cv  —  7)/5]I/2,  ^  =  10[(*-2)/3]'/2. 

All  of  the  points  (3/2  +  2,5/2  +  7)  lie  on  the  line  through  (2,7)  with  slope  f. 
However,  x  is  always  greater  than  or  equal  to  2,  and  y  is  always  greater 
than  or  equal  to  7.  Hence,  the  orbit  of  the  solution  x  =  3/2  +  2,  y  =  5/2  +  7, 
0<  t<  oo,  is  the  straight  line>>  =  f(x-2)  +  7,  2<x<oo. 


Example  5.  It  is  easily  verified  that  x  =  3/  +  2,>>  =  5/2  +  7,  -  oo  <  /  <  oo,  is  a 
solution  of  the  system  of  differential  equations 


dx 

dt 


-y-  f(*-2)2— 4, 


dy_ 

dt 


10 

3 


(x-2). 
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The  orbit  of  this  solution  is  the  set  of  all  points  =  (3/  +  2, 5/2  H-  7). 
Solving  for  t  =  |(x  —  2),  we  see  that  y  =  f  (x  —  2)2  4-  7.  Hence,  the  orbit  of  the 

solution  x  =  3/  +  2, y  =  5/2  +  7  is  the  parabola^  =  f(x  — 2)2  +  7,  |x|<oo. 

One  of  the  advantages  of  considering  the  orbit  of  the  solution  rather 
than  the  solution  itself  is  that  is  is  often  possible  to  obtain  the  orbit  of  a 
solution  without  prior  knowledge  of  the  solution.  Let  x  =  x(t),y  =y{t)  be  a 
solution  of  (1).  If  x'(t)  is  unequal  to  zero  at  t  =  tx,  then  we  can  solve  for 
t  —  t(x)  in  a  neighborhood  of  the  point  x,  =  x{tx)  (see  Exercise  4).  Thus,  for 
/  near  tx .  the  orbit  of  the  solution  x(t),y(t)  is  the  curves  —y{t{x)).  Next, 
observe  that 


dy  _dy  dt  _  dy/dt  _  g(x,y ) 
dx  dt  dx  dx/dt  f(x,y) 

Thus,  the  orbits  of  the  solutions  x  =  x(/),  y  =y(t)  of  (1)  are  the  solution 
curves  of  the  first-order  scalar  equation 

dy  _  g(x,y) 

dx  f(x,y )  ■  {  } 

Therefore,  it  is  not  necessary  to  find  a  solution  x(t),  y(t)  of  (1)  in  order  to 
compute  its  orbit;  we  need  only  solve  the  single  first-order  scalar  differen¬ 
tial  equation  (2). 

Remark.  From  now  on,  we  will  use  the  phrase  “the  orbits  of  (1)”  to  denote 
the  totality  of  orbits  of  solutions  of  (1). 


Example  6.  The  orbits  of  the  system  of  differential  equations 


dx 

dt 


dt 


x 


2 


are  the  solution  curves  of  the  scalar  equation  dy /dx  =  x2/y2.  This  equation 
is  separable,  and  it  is  easily  seen  that  every  solution  is  of  the  form  >>(*)  = 
(x3  —  c)1/3,  c  constant.  Thus,  the  orbits  of  (3)  are  the  set  of  all  curves 
y  =  (x3  —  c)1/3. 


Example  7.  The  orbits  of  the  system  of  differential  equations 

^ =y{\  +  x2+y%  ^  =  -2x(\  +  x2+y2)  (4) 

are  the  solution  curves  of  the  scalar  equation 

dy  _  2x(l  +  x2+y2)  _  2x 
dx  y(\  +  x*+y2)  y 

This  equation  is  separable,  and  all  solutions  are  of  the  form  \y2  +  x2  =  c2. 
Hence,  the  orbits  of  (4)  are  the  families  of  ellipses  \y2  +  x2  =  c2. 
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Warning.  A  solution  curve  of  (2)  is  an  orbit  of  (1)  only  if  dx/dt  and  dy  /  dt 
are  not  zero  simultaneously  along  the  solution.  If  a  solution  curve  of  (2) 
passes  through  an  equilibrium  point  of  (1),  then  the  entire  solution  curve  is 
not  an  orbit.  Rather,  it  is  the  union  of  several  distinct  orbits.  For  example, 
consider  the  system  of  differential  equations 

^-=y{\-x2-y2),  ^  =  -x(l-x2-y2).  (5) 

The  solution  curves  of  the  scalar  equation 

dy  =  dy/dt  =  x 
dx  dx/dt  y 

are  the  family  of  concentric  circles  x2+y2  =  c2.  Observe,  however,  that  ev¬ 
ery  point  on  the  unit  circle  x2+y2=  1  is  an  equilibrium  point  of  (5).  Thus, 
the  orbits  of  this  system  are  the  circles  x2+y2  =  c2,  for  c=£  1,  and  all  points 
on  the  unit  circle  x2+y2=  1.  Similarly,  the  orbits  of  (3)  are  the  curves  y  = 
(x3  —  c)1/3,  c  t^O;  the  half-lines  y  =  x,  x>0,  and  y  =  x,  x<0;  and  the  point 
(0,0). 

It  is  not  possible,  in  general,  to  explicitly  solve  Equation  (2).  Hence,  we 
cannot,  in  general,  find  the  orbits  of  (1).  Nevertheless,  it  is  still  possible  to 
obtain  an  accurate  description  of  all  orbits  of  (1).  This  is  because  the  sys¬ 
tem  of  differential  equations  (1)  sets  up  a  direction  field  in  the  x—y  plane. 
That  is  to  say,  the  system  of  differential  equations  (1)  tells  us  how  fast  a 
solution  moves  along  its  orbit,  and  in  what  direction  it  is  moving.  More 
precisely,  let  x  =  *(/),>>  =y{t)  be  a  solution  of  (1).  As  t  increases,  the  point 
(*(/),>>(/))  moves  along  the  orbit  of  this  solution.  Its  velocity  in  the  x-di- 
rection  is  dx/dt\  its  velocity  in  the y -direction  is  dy / dt;  and  the  magnitude 
of  its  velocity  is  [(dx(t)/dt)2  +  (dy(t)/dt)2]1/2.  But  dx(t)/ dt  =f  (x(t),  y(t)), 
and  cfy(t)/dt  =  g(x(t),y(t)).  Hence,  at  each  point  (x,y)  in  the  phase  plane 
of  (1)  we  know  (i),  the  tangent  to  the  orbit  through  (x ,y)  (the  line  through 
(jt,  y)  with  direction  numbers  f(x ,  y\  g(x ,  y)  respectively)  and  (ii),  the 
speed  [  f2(x ,  y)+  g2(x ,  y)]x/2  with  which  the  solution  is  traversing  its  orbit. 
As  we  shall  see  in  Sections  4.8-13,  this  information  can  often  be  used  to 
deduce  important  properties  of  the  orbits  of  (1). 

The  notion  of  orbit  can  easily  be  extended  to  the  case  n>  2.  Let  x  =  x(0 
be  a  solution  of  the  vector  differential  equation 

Xj  /l  (X|, . . . ,  xf) 

x=f(x),  x  =  :  ,  f(x)=  :  (6) 

•  • 

>.  n  -  fn  (-*1?  •  •  •  >  -*>j) 

on  the  interval  t0  <  t  <  As  t  runs  from  t0  to  the  set  of  points 
(*i(0,  •  •  •  >xn(t))  trace  out  a  curve  C  in  the  ^-dimensional  space 
xl9x2,...9xn.  This  curve  is  called  the  orbit  of  the  solution  x  =  x(/),  for  t0<  t 
<  /„  and  the  Ai-dimensional  space  xl9...9xn  is  called  the  phase-space  of  the 
solutions  of  (6). 
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Exercises 


In  each  of  Problems  1-3,  verify  that  x(t\y(t)  is  a  solution  of  the  given  sys¬ 
tem  of  equations,  and  find  its  orbit. 

1.  x=l,  y  =  2(1  —  x)sin(l  —  xf 
x(/)=l  +  /,  y(t)  =  cost2 

2.  x  =  e~x,  y-eeX~l 

x(0*ln(l  +  0>  y( i)  =  et 

3.  x=l  +  x2,  y  —  ( l+x2)sec2A: 

x(/)  =  tan/,  y(/)  =  tan(tan/) 


4.  Suppose  that  x'(tx)^0.  Show  that  we  can  solve  the  equation  x  =  x(t)  for  t  = 
t(x)  in  a  neighborhood  of  the  point  xx  =  x(tx).  Hint:  If  x'(tx)=£ 0,  then  x{t)  is  a 
strictly  monotonic  function  of  /  in  a  neighborhood  of  /  =  tx. 


Find  the  orbits  of  each  of  the  following  systems. 


5.  x=y, 
y=-x 

7.  x=y(l  +  x+y), 

y=-X(\+X+y ) 

9.  x=xye~3x , 
y—  —2xy2 

11.  x  =  ax  —  bxy, 
y  =  cx  —  dxy 
(a,b,c,d  positive) 

13.  x  —  2  xy, 

•  2  2 
y  —  x  —y 


6.  x=y(l  +  x2+y2), 
y=  -  x(\  +  x2+y2) 

8.  x=y  +  x^y, 

y  =  3x  +  xy2 

10.  x  =  4y, 
y  =  x  +  xy2 

12.  x  =  x2  +  cosy, 
y=-2xy 

14.  x=y  +  sin x, 
y  —  x—  y  cos  a: 


4.5  Mathematical  theories  of  war 

4.5.1.  L.  F.  Richardson's  theory  of  conflict 

In  this  section  we  construct  a  mathematical  model  which  describes  the  re¬ 
lation  between  two  nations,  each  determined  to  defend  itself  against  a  pos¬ 
sible  attack  by  the  other.  Each  nation  considers  the  possibility  of  attack 
quite  real,  and  reasonably  enough,  bases  its  apprehensions  on  the  readiness 
of  the  other  to  wage  war.  Our  model  is  based  on  the  work  of  Lewis  Fry 
Richardson.  It  is  not  an  attempt  to  make  scientific  statements  about  fore¬ 
ign  politics  or  to  predict  the  date  at  which  the  next  war  will  break  out. 
This,  of  course,  is  clearly  impossible.  Rather,  it  is  a  description  of  what 
people  would  do  if  they  did  not  stop  to  think.  As  Richardson  writes:  “Why 
are  so  many  nations  reluctantly  but  steadily  increasing  their  armaments  as 
if  they  were  mechanically  compelled  to  do  so?  Because,  I  say,  they  follow 
their  traditions  which  are  fixtures  and  their  instincts  which  are  mechanical; 
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and  because  they  have  not  yet  made  a  sufficiently  strenuous  intellectual 
and  moral  effort  to  control  the  situation.  The  process  described  by  the  en¬ 
suing  equations  is  not  to  be  thought  of  as  inevitable.  It  is  what  would  occur 
if  instinct  and  tradition  were  allowed  to  act  uncontrolled.” 

Let  x  =  x{t)  denote  the  war  potential,  or  armaments,  of  the  first  nation, 
which  we  will  call  Jedesland,  and  let  y(t)  denote  the  war  potential  of  the 
second  nation,  which  we  will  call  Andersland.  The  rate  of  change  of  x(t) 
depends,  obviously,  on  the  war  readiness  y(t)  of  Andersland,  and  on  the 
grievances  that  Jedesland  feels  towards  Andersland.  In  the  most  simplistic 
model  we  represent  these  terms  by  ky  and  g  respectively,  where  k  and  g 
are  positive  constants.  These  two  terms  cause  x  to  increase.  On  the  other 
hand,  the  cost  of  armaments  has  a  restraining  effect  on  dx/dt .  We  repre¬ 
sent  this  term  by  —  ax,  where  a  is  a  positive  constant.  A  similar  analysis 
holds  for  dy /dt.  Consequently,  x  =  x(t),  y  =y(t)  is  a  solution  of  the  linear 
system  of  differential  equations 

^  =  ky-ax+g,  ^  =  lx  —  fty  +  h.  (1) 

Remark.  The  model  (1)  is  not  limited  to  two  nations;  it  can  also  represent 
the  relation  between  two  alliances.  For  example,  Andersland  and  Jedes¬ 
land  can  represent  the  alliances  of  France  with  Russia,  and  Germany  with 
Austria-Hungary  during  the  years  immediately  prior  to  World  War  I. 

Throughout  history,  there  has  been  a  constant  debate  on  the  cause  of 
war.  Over  two  thousand  years  ago,  Thucydides  claimed  that  armaments 
cause  war.  In  his  account  of  the  Peloponnesian  war  he  writes:  “The  real 
though  unavowed  cause  I  believe  to  have  been  the  growth  of  Athenian 
power,  which  terrified  the  Lacedaemonians  and  forced  them  into  war.”  Sir 
Edward  Grey,  the  British  Foreign  Secretary  during  World  War  I  agrees. 
He  writes:  “The  increase  of  armaments  that  is  intended  in  each  nation  to 
produce  consciousness  of  strength,  and  a  sense  of  security,  does  not  pro¬ 
duce  these  effects.  On  the  contrary,  it  produces  a  consciousness  of  the 
strength  of  other  nations  and  a  sense  of  fear.  The  enormous  growth  of 
armaments  in  Europe,  the  sense  of  insecurity  and  fear  caused  by  them — it 
was  these  that  made  war  inevitable.  This  is  the  real  and  final  account  of 
the  origin  of  the  Great  War.” 

On  the  other  hand,  L.  S.  Amery,  a  member  of  Britain’s  parliament 
during  the  1930’s  vehemently  disagrees.  When  the  opinion  of  Sir  Edward 
Grey  was  quoted  in  the  House  of  Commons,  Amery  replied:  “With  all  due 
respect  to  the  memory  of  an  eminent  statesman,  I  believe  that  statement  to 
be  entirely  mistaken.  The  armaments  were  only  the  symptoms  of  the  con¬ 
flict  of  ambitions  and  ideals,  of  those  nationalist  forces  which  created  the 
War.  The  War  was  brought  about  because  Serbia,  Italy  and  Rumania 
passionately  desired  the  incorporation  in  their  states  of  territories  which  at 
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that  time  belonged  to  the  Austrian  Empire  and  which  the  Austrian  govern¬ 
ment  was  not  prepared  to  abandon  without  a  struggle.  France  was  pre¬ 
pared,  if  the  opportunity  ever  came,  to  make  an  effort  to  recover  Alsace- 
Lorraine.  It  was  in  those  facts,  in  those  insoluble  conflicts  of  ambitions, 
and  not  in  the  armaments  themselves,  that  the  cause  of  the  War  lay.” 

The  system  of  equations  (1)  takes  both  conflicting  theories  into  account. 
Thucydides  and  Sir  Edward  Grey  would  take  g  and  h  small  compared  to  k 
and  /,  while  Mr.  Amery  would  take  k  and  /  small  compared  to  g  and  h . 

The  system  of  equations  (1)  has  several  important  implications.  Suppose 
that  g  and  h  are  both  zero.  Then,  x(/)=0,  j>(/)=0  is  an  equilibrium  solu¬ 
tion  of  (1).  That  is,  if  x,y,  g ,  and  h  are  all  made  zero  simultaneously,  then 
x(t)  and  y{t)  will  always  remain  zero.  This  ideal  condition  is  permanent 
peace  by  disarmament  and  satisfaction.  It  has  existed  since  1817  on  the 
border  between  Canada  and  the  United  States,  and  since  1905  on  the 
border  between  Norway  and  Sweden. 

These  equations  further  imply  that  mutual  disarmament  without  satis¬ 
faction  is  not  permanent.  Assume  that  x  and  y  vanish  simultaneously  at 
some  time  t  =  t0.  At  this  time,  dx/dt  =  g  and  dy / dt  =  h.  Thus,  x  and  y  will 
not  remain  zero  if  g  and  h  are  positive.  Instead,  both  nations  will  rearm. 

Unilateral  disarmament  corresponds  to  setting  >>  =  0  at  a  certain  instant 
of  time.  At  this  time,  dy / dt  =  lx  +  h.  This  implies  that  y  will  not  remain 
zero  if  either  h  or  x  is  positive.  Thus,  unilateral  disarmament  is  never  per¬ 
manent.  This  accords  with  the  historical  fact  that  Germany,  whose  army 
was  reduced  by  the  Treaty  of  Versailles  to  100,000  men,  a  level  far  below 
that  of  several  of  her  neighbors,  insisting  on  rearming  during  the  years 
1933-36. 

A  race  in  armaments  occurs  when  the  “defense”  terms  predominate  in 
(1).  In  this  case, 


dx 

dt 


—  ky. 


<fy_ 

dt 


—  lx. 


Every  solution  of  (2)  is  of  the  form 


x(t)  =  AeVFl,+  Be-VFn,  y(t)  =  -\fl  [A, 


>Vki  t 


—  Be 


-Vki 


']• 


Therefore,  both  x{t)  and>>(7)  approach  infinity  if  A  is  positive.  This  infin¬ 
ity  can  be  interpreted  as  war. 

Now,  the  system  of  equations  (1)  is  not  quite  correct,  since  it  does  not 
take  into  effect  the  cooperation,  or  trade,  between  Andersland  and  Jedes- 
land.  As  we  see  today,  mutual  cooperation  between  nations  tends  to  de¬ 
crease  their  fears  and  suspicions.  We  correct  our  model  by  changing  the 
meaning  of  x(t)  and  y(t);  we  let  the  variables  x{t)  and  y{t)  stand  for 
“threats”  minus  “cooperation.”  Specifically,  we  set  x=U—  U0  and  y  = 
V  —  V0 ,  where  U  is  the  defense  budget  of  Jedesland,  V  is  the  defense 
budget  of  Andersland,  U0  is  the  amount  of  goods  exported  by  Jedesland  to 
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Andersland  and  V0  is  the  amount  of  goods  exported  by  Andersland  to 
Jedesland.  Observe  that  cooperation  evokes  reciprocal  cooperation,  just  as 
armaments  provoke  more  armaments.  In  addition,  nations  have  a  tendency 
to  reduce  cooperation  on  account  of  the  expense  which  it  involves.  Thus, 
the  system  of  equations  (1)  still  describes  this  more  general  state  of  affairs. 

The  system  of  equations  (1)  has  a  single  equilibrium  solution 


kh  +  fig  Ig- 1-  ah 

x  =  x°=l^kl’  y=y°=^Ti 


if  afi  —  kh^  0.  We  are  interested  in  determining  whether  this  equilibrium 
solution  is  stable  or  unstable.  To  this  end,  we  write  (1)  in  the  form  w  =  Aw 
+  f,  where 


[  *(0  \ 
Uo/’ 

The  equilibrium  solution  is 


'■(*)•  “d  A_(  “ 


where  Aw0  +  f  =  0.  Setting  z  = 


w  —  w0,  we  obtain  that 


z  =  w  =  Aw  +  f  =  A(z  +  w0)  +  f  =  Az  +  Aw0  +  f  =  A z. 


Clearly,  the  equilibrium  solution  w(7)  =  w0  of  w  =  Aw  +  f  is  stable  if,  and 
only  if,  z  =  0  is  a  stable  solution  of  z  =  Az.  To  determine  the  stability  of  z  = 
0  we  compute 


p(X)  —  det^  ^  ^  ^  j  —  A2  +  (a  +  fi  )A  4-  afi  — ■  kl. 

The  roots  of  p( A)  are 

—  (a  +  (3)±  |^(a  +  ^S)2  —  4(«jS  —  /c/)j  ^ 


—  (a  +  fi)±  [(a  —  /?)2  +  4/:/j  ^ 

!  2  * 

Notice  that  both  roots  are  real  and  unequal  to  zero.  Moreover,  both  roots 
are  negative  if  a/3—  kl> 0,  and  one  root  is  positive  if  aft  —  kl< 0.  Thus,  z (t) 
=  0,  and  consequently  the  equilibrium  solution  x(t)  =  x0,y(t)=y0  is  stable 
if  afi  —  kl  >  0  and  unstable  if  afi  —  kl<  0. 

Let  us  now  tackle  the  difficult  problem  of  estimating  the  coefficients  a , 
fi,  k,  /,  g,  and  h.  There  is  no  way,  obviously,  of  measuring  g  and  h.  How¬ 
ever,  it  is  possible  to  obtain  reasonable  estimates  for  a,  fi,  k,  and  /.  Observe 
that  the  units  of  these  coefficients  are  reciprocal  times.  Physicists  and  en¬ 
gineers  would  call  a  ~ 1  and  fi  ~ 1  relaxation  times,  for  if  y  and  g  were  identi¬ 
cally  zero,  then  x(t)  =  e~a{t~tQ)x{t0).  This  implies  that  x(f0+a-1)  = 
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*00)/e.  Hence,  a-1  is  the  time  required  for  Jedesland’s  armaments  to  be 
reduced  in  the  ratio  2.718  if  that  nation  has  no  grievances  and  no  other  na¬ 
tion  has  any  armaments.  Richardson  estimates  a-1  to  be  the  lifetime  of 
Jedesland’s  parliament.  Thus,  a  =  0.2  for  Great  Britain,  since  the  lifetime  of 
Britain’s  parliament  is  five  years. 

To  estimate  k  and  /  we  take  a  hypothetical  case  in  which  g  =  0  and  >>  = 
yv  so  that  dx/dt  =  kyx  —  ax .  When  x  =  0,  \/k=yx/(dx/dt).  Thus,  \/k  is 
the  time  required  for  Jedesland  to  catch  up  to  Andersland  provided  that  (i) 
Andersland’s  armaments  remain  constant,  (ii)  there  are  no  grievances,  and 
(iii)  the  cost  of  armaments  doesn’t  slow  Jedesland  down.  Consider  now  the 
German  rearmament  during  1933-36.  Germany  started  with  nearly  zero 
armaments  and  caught  up  with  her  neighbors  in  about  three  years.  Assum¬ 
ing  that  the  slowing  effect  of  a  nearly  balanced  the  Germans’  very  strong 
grievances  g,  we  take  k  =  0.3  (year)-1  for  Germany.  Further,  we  observe 
that  k  is  obviously  proportional  to  the  amount  of  industry  that  a  nation 
has.  Thus,  fc  =  0.15  for  a  nation  which  has  only  half  the  industrial  capacity 
of  Germany,  and  k  =  0.9  for  a  nation  which  has  three  times  the  industrial 
capacity  of  Germany. 

Let  us  now  check  our  model  against  the  European  arms  race  of 
1909-1914.  France  was  allied  with  Russia,  and  Germany  was  allied  with 
Austria-Hungary.  Neither  Italy  or  Britain  was  in  a  definite  alliance  with 
either  party.  Thus,  let  Jedesland  represent  the  alliance  of  France  with 
Russia,  and  let  Andersland  represent  the  alliance  of  Germany  with 
Austria-Hungary.  Since  these  two  alliances  were  roughly  equal  in  size  we 
take  k  =  /,  and  since  each  alliance  was  roughly  three  times  the  size  of 
Germany,  we  take  k  =  1=0.9.  We  also  assume  that  a  =  (3=0.2.  Then, 


dx 

dt 


—  ax  +  ky  +  g, 


dt 


=  kx  —  ay  +  h. 


Equation  (4)  has  a  unique  equilibrium  point 

kh  4-  ag  kg  +  ah 

X°=  a2—k2  ’  y°=  <x2-k2  ' 


This  equilibrium  is  unstable  since 

aft  —  kl=  a2—  k2  =  0.04 -0.81  =  -  0.77. 

This,  of  course,  is  in  agreement  with  the  historical  fact  that  these  two  alli¬ 
ances  went  to  war  with  each  other. 

Now,  the  model  we  have  constructed  is  very  crude  since  it  assumes  that 
the  grievances  g  and  h  are  constant  in  time.  This  is  obviously  not  true.  The 
grievances  g  and  h  are  not  even  continuous  functions  of  time  since  they 
jump  instantaneously  by  large  amounts.  (It’s  safe  to  assume,  though,  that  g 
and  h  are  relatively  constant  over  long  periods  of  time.)  In  spite  of  this,  the 
system  of  equations  (4)  still  provides  a  very  accurate  description  of  the 
arms  race  preceding  World  War  I.  To  demonstrate  this,  we  add  the  two 
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equations  of  (4)  together,  to  obtain  that 

x+y)  =  (k-a)(x+y)  +  g+h .  (5) 

Recall  that  x=  U—  U0  and  y—V—V0,  where  U  and  V  are  the  defense 
budgets  of  the  two  alliances,  and  U0  and  V0  are  the  amount  of  goods  ex¬ 
ported  from  each  alliance  to  the  other.  Hence, 


j-W+V).(k-c) 


U+  V- 


U0+  V0- 


g  +  h 
k  —  a 


a  dt 


£(tfo+*o) 


\ 

1  . 


The  defense  budgets  for  the  two  alliances  are  set  out  in  Table  I. 


Table  1.  Defense  budgets  expressed  in  millions  of  -£  ster¬ 
ling 


1909 

1910 

1911 

1912 

1913 

France 

48.6 

50.9 

57.1 

63.2 

74.7 

Russia 

66.7 

68.5 

70.7 

81.8 

92.0 

Germany 

63.1 

62.0 

62.5 

68.2 

95.4 

Austria-Hungary 

20.8 

23.4 

24.6 

25.5 

26.9 

Total  U+  V 

199.2 

204.8 

214.9 

238.7 

289.0 

A(U+  V ) 

5.6  10.1 

23.8 

50.3 

U+V  at  same  date 

202.0  209.8 

226.8  263.8 

In  Figure  1  we  plot  the  annual  increment  of  U+V  against  the  average 
of  U+V  for  the  two  years  used  in  forming  the  increment.  Notice  how 
close  these  four  points,  denoted  by  o,  are  to  the  straight  line 

A(f/+  F)  =  0.73(£/+  V—  194).  (7) 

Thus,  foreign  politics  does  indeed  have  a  machine-like  predictability. 
Equations  (6)  and  (7)  imply  that 

g+  h  =  (k  —  a)(U0  +  F0)  — A(£/0+  VQ)—  194 

and  k  —  a  —  0.73.  This  is  in  excellent  agreement  with  Richardson’s  esti¬ 
mates  of  0.9  for  k  and  0.2  for  a.  Finally,  observe  from  (7)  that  the  total  de¬ 
fense  budgets  of  the  two  alliances  will  increase  if  C/+  V  is  greater  than  194 
million,  and  will  decrease  otherwise.  In  actual  fact,  the  defense  expendi¬ 
tures  of  the  two  alliances  was  199.2  million  in  1909  while  the  trade  between 
the  two  alliances  amounted  to  only  171.8  million.  Thus  began  an  arms  race 
which  led  eventually  to  World  War  I. 
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Reference 

Richardson,  L.  F.,  “Generalized  foreign  politics,”  The  British  Journal  of  Psychol¬ 
ogy,  monograph  supplement  #23,  1939. 


Exercises 


1.  Suppose  that  what  moves  a  government  to  arm  is  not  the  magnitude  of  other  na¬ 
tions’  armaments,  but  the  difference  between  its  own  and  theirs.  Then, 

"  k(y  “  *)  “  ax  +  S’  w  =  ~y^  ~  +  h' 

Show  that  every  solution  of  this  system  of  equations  is  stable  if  A^/,  <((*!  + 
k\)(P\  +  h)  and  unstable  if  klll>(al  +  kl)U$l  +  lx). 


2.  Consider  the  case  of  three  nations,  each  having  the  same  defense  coefficient  k 
and  the  same  restraint  coefficient  a.  Then, 


Setting 


u  = 


X 


y 

z 


dx 

dt 


—  ax  +  ky  +  kz  +  gx 


<fy_ 

dt 


=  kx  —  ay  +  kz  +  g2 


dz 

dt 


=  kx  + ky  —  az  +  g3. 


[  -a  k  k 
A=  k  -a  k 
\  k  k  —  a 


and  g  = 


8i  , 

83) 


we  see  that  n  =  Au  +  g. 

(a)  Show  that  p(X)  =  det(A  —  XI)  =  —  (a  +  A)3  +  3 k2(a  +  A)  +  2k3. 

(b)  Show  that  /?(A)  =  0  when  X  =  —  a  — k.  Use  this  information  to  find  the  re¬ 
maining  two  roots  of  p(X). 
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(c)  Show  that  every  solution  u(t)  is  stable  if  2k  <  a,  and  unstable  if  2k  >  a. 

k  Suppose  in  Problem  2  that  the  z  nation  is  a  pacifist  nation,  while  x  and  y  are 
pugnacious  nations.  Then, 

^  =  —  ax  +  ky  +  kz  +  gx 

=  kx-ay  +  kz  +  g2  (*) 

^=0-x  +  0  -y-az  +  g3. 

Show  that  eveiy  solution  x(t),y(t),z(t)  of  (*)  is  stable  if  k<a,  and  unstable  if 
k>a. 

4.5.2  Lanchester’s  combat  models  and  the  battle  of  Iwo  Jima 

During  the  first  World  War,  F.  W.  Lanchester  [4]  pointed  out  the  impor¬ 
tance  of  the  concentration  of  troops  in  modern  combat.  He  constructed 
mathematical  models  from  which  the  expected  results  of  an  engagement 
could  be  obtained.  In  this  section  we  will  derive  two  of  these  models,  that 
of  a  conventional  force  versus  a  conventional  force,  and  that  of  a  conven¬ 
tional  force  versus  a  guerilla  force.  We  will  then  solve  these  models,  or 
equations,  and  derive  “Lanchester’s  square  law,”  which  states  that  the 
strength  of  a  combat  force  is  proportional  to  the  square  of  the  number  of 
combatants  entering  the  engagement.  Finally,  we  will  fit  one  of  these  mod¬ 
els,  with  astonishing  accuracy,  to  the  battle  of  Iwo  Jima  in  World  War  II. 

(a)  Construction  of  the  models 

Suppose  that  an  “x-force”  and  a  “>>-force”  are  engaged  in  combat.  For 
simplicity,  we  define  the  strengths  of  these  two  forces  as  their  number  of 
combatants.  (See  Howes  and  Thrall  [3]  for  another  definition  of  combat 
strength.)  Thus  let  x(t )  and^(/)  denote  the  number  of  combatants  of  the  x 
and  y  forces,  where  t  is  measured  in  days  from  the  start  of  the  combat. 
Clearly,  the  rate  of  change  of  each  of  these  quantities  equals  its  reinforce¬ 
ment  rate  minus  its  operational  loss  rate  minus  its  combat  loss  rate. 

The  operational  loss  rate.  The  operational  loss  rate  of  a  combat  force  is  its 
loss  rate  due  to  non-combat  mishaps;  i.e.,  desertions,  diseases,  etc. 
Lanchester  proposed  that  the  operational  loss  rate  of  a  combat  force  is 
proportional  to  its  strength.  However,  this  does  not  appear  to  be  very  real¬ 
istic.  For  example,  the  desertion  rate  in  a  combat  force  depends  on  a  host 
of  psychological  and  other  intangible  factors  which  are  difficult  even  to  de¬ 
scribe,  let  alone  quantify.  We  will  take  the  easy  way  out  here  and  consider 
only  those  engagements  in  which  the  operational  loss  rates  are  negligible. 

The  combat  loss  rate.  Suppose  that  the  x-force  is  a  conventional  force 
which  operates  in  the  open,  comparatively  speaking,  and  that  every  mem¬ 
ber  of  this  force  is  within  “kill  range”  of  the  enemy  y.  We  also  assume  that 
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as  soon  as  the  conventional  force  suffers  a  loss,  fire  is  concentrated  on  the 
remaining  combatants.  Under  these  “ideal”  conditions,  the  combat  loss 
rate  of  a  conventional  force  x  equals  ay(t),  for  some  positive  constant  a. 
This  constant  is  called  the  combat  effectiveness  coefficient  of  the  /-force. 

The  situation  is  very  different  if  x  is  a  guerilla  force,  invisible  to  its 
opponent  /  and  occupying  a  region  R.  The  /-force  fires  into  R  but  cannot 
know  when  a  kill  has  been  made.  It  is  certainly  plausible  that  the  combat 
loss  rate  for  a  guerilla  force  x  should  be  proportional  to  x{t\  for  the  larger 
x(t ),  the  greater  the  probability  that  an  opponent’s  shot  will  kill.  On  the 
other  hand,  the  combat  loss  rate  for  x  is  also  proportional  to/(/),  for  the 
larger/,  the  greater  the  number  of  ^-casualties.  Thus,  the  combat  loss  rate 
for  a  guerilla  force  x  equals  cx(t)y(t),  where  the  constant  c  is  called  the 
combat  effectiveness  coefficient  of  the  opponent/. 

The  reinforcement  rate .  The  reinforcement  rate  of  a  combat  force  is  the 
rate  at  which  new  combatants  enter  (or  are  withdrawn  from)  the  battle.  We 
denote  the  reinforcement  rates  of  the  jc-  and  /-forces  by  /(/)  and  g(t)  re¬ 
spectively. 

Under  the  assumptions  listed  above,  we  can  now  write  down  the  follow¬ 
ing  two  Lanchestrian  models  for  conventional-guerilla  combat. 


Conventional  combat: 


Conventional-guerilla  combat: 

(x  =  guerilla) 

The  system  of  equations  (la)  is  a  linear  system  and  can  be  solved  explicitly 
once  a ,  b,f(t ),  and  g(t)  are  known.  On  the  other  hand,  the  system  of  equa¬ 
tions  (lb)  is  nonlinear,  and  its  solution  is  much  more  difficult.  (Indeed,  it 
can  only  be  obtained  with  the  aid  of  a  digital  computer.) 

It  is  very  instructive  to  consider  the  special  case  where  the  reinforce¬ 
ment  rates  are  zero.  This  situation  occurs  when  the  two  forces  are  “iso¬ 
lated.”  In  this  case  (la)  and  (lb)  reduce  to  the  simpler  systems 


dx 

dt 

dy_ 

dt 


—  —  ay  +/(/) 

=  -foc+g(f) 


(la) 


dx 

dt 

dy_ 

dt 


=  —  cxy+f(t) 
=  -dx  +  g(t) 


(lb) 


and 


dx 

dt 


- ay , 


dx 

dt 


—  cxy9 


dy_ 

dt 


—  dx. 
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Conventional  combat :  The  square  law.  The  orbits  of  system  (2a)  are  the 
solution  curves  of  the  equation 


❖ 

dx 


bx 

ay 


<iy  . 

or  ay~r=bx. 
dx 


Integrating  this  equation  gives 

ay2  -  bx 2  =  ay\ -  bxl  =  K.  (3) 

The  curves  (3)  define  a  family  of  hyperbolas  in  the  x-y  plane  and  we  have 
indicated  their  graphs  in  Figure  1.  The  arrowheads  on  the  curves  indicate 
the  direction  of  changing  strengths  as  time  passes. 

Let  us  adopt  the  criterion  that  one  force  wins  the  battle  if  the  other 
force  vanishes  first.  Then,  y  wins  if  K  >  0  since  the  x-force  has  been 

annihilated  by  the  timey(/)  has  decreased  to  V K/  a  .  Similarly,  x  wins  if 

a:<o. 


Remark  1.  Equation  (3)  is  often  referred  to  as  “Lanchester’s  square  law,” 
and  the  system  (2a)  is  often  called  the  square  law  model,  since  the 
strengths  of  the  opposing  forces  appear  quadratically  in  (3).  This  terminol¬ 
ogy  is  rather  anomolous  since  the  system  (2a)  is  actually  a  linear  system. 

Remark  2.  They-force  always  seeks  to  establish  a  setting  in  which  ^>0. 
That  is  to  say,  the  >>-force  wants  the  inequality 

ayl  >  bxl 

to  hold.  This  can  be  accomplished  by  increasing  a ;  i.e.  by  using  stronger 
and  more  accurate  weapons,  or  by  increasing  the  initial  force  y0.  Notice 
though  that  a  doubling  of  a  results  in  a  doubling  of  ay\  while  a  doubling  of 
y0  results  in  a  four-fold  increase  of  ay\.  This  is  the  essence  of  Lanchester’s 
square  law  of  conventional  combat. 
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Conventional-guerilla  combat .  The  orbits  of  system  (2b)  are  the  solution 
curves  of  the  equation 

dy  =  dx  _  d_ 
dx  cxy  cy  * 

Multiplying  both  sides  of  (4)  by  cy  and  integrating  gives 

cy2  —  2dx  =  cy\  —  2  dx0  =  M.  (5) 

The  curves  (5)  define  a  family  of  parabolas  in  the  x—y  plane,  and  we  have 
indicated  their  graphs  in  Figure  2.  The  7-force  wins  if  M  >  0,  since  the  x- 

force  has  been  annihilated  by  the  time  y(t)  has  decreased  to  V M / c  .  Simi¬ 
larly,  x  wins  if  M  <  0. 


Remark.  It  is  usually  impossible  to  determine,  a  priori,  the  numerical  value 
of  the  combat  coefficients  a,  b,  c,  and  d.  Thus,  it  would  appear  that 
Lanchester’s  combat  models  have  little  or  no  applicability  to  real-life  en¬ 
gagements.  However,  this  is  not  so.  As  we  shall  soon  see,  it  is  often  possi¬ 
ble  to  determine  suitable  values  of  a  and  b  (or  c  and  d)  using  data  from  the 
battle  itself.  Once  these  values  are  established  for  one  engagement,  they  are 
known  for  all  other  engagements  which  are  fought  under  similar  condi¬ 
tions. 

(b)  The  battle  of  Iwo  Jima 

One  of  the  fiercest  battles  of  World  War  II  was  fought  on  the  island  of  Iwo 
Jima,  660  miles  south  of  Tokyo.  Our  forces  coveted  Iwo  Jima  as  a  bomber 
base  close  to  the  Japanese  mainland,  while  the  Japanese  needed  the  island 
as  a  base  for  fighter  planes  attacking  our  aircraft  on  their  way  to  bombing 
missions  over  Tokyo  and  other  major  Japanese  cities.  The  American  inva- 
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sion  of  Iwo  Jima  began  on  February  19,  1945,  and  the  fighting  was  intense 
throughout  the  month  long  combat.  Both  sides  suffered  heavy  casualties 
(see  Table  1).  The  Japanese  had  been  ordered  to  fight  to  the  last  man,  and 
this  is  exactly  what  they  did.  The  island  was  declared  “secure”  by  the 
American  forces  on  the  28th  day  of  the  battle,  and  all  active  combat 
ceased  on  the  36th  day.  (The  last  two  Japanese  holdouts  surrendered  in 
1951!) 


Table  1.  Casualties  at  Iwo  Jima 


Total  United  States  casualties  at  Iwo  Jima 

Killed,  missing  or  Wounded  Combat  Fatigue  Total 
died  of  wounds 


Marines 

Navy  units: 

5,931 

17,272 

2,648 

25,851 

Ships  and  air  units 

633 

1,158 

1,791 

Medical  corpsmen 

195 

529 

724 

Seabees 

51 

218 

269 

Doctors  and  dentists 

2 

12 

14 

Army  units  in  battle 

9 

28 

37 

Grand  totals 

6,821 

19,217 

2,648 

28,686 

Japanese  casualties  at  Iwo  Jima 

Defense  forces  Prisoners  Killed 

(Estimated) 

21,000  Marine  216  20,000 

Army  867 
Total  1,083 


(Newcomb  [6],  page  296) 


The  following  data  is  available  to  us  from  the  battle  of  Iwo  Jima. 

1.  Reinforcement  rates .  During  the  conflict  Japanese  troops  were 
neither  withdrawn  nor  reinforced.  The  Americans,  on  the  other  hand, 
landed  54,000  troops  on  the  first  day  of  the  battle,  none  on  the  second, 
6,000  on  the  third,  none  on  the  fourth  and  fifth,  13,000  on  the  sixth  day, 
and  none  thereafter.  There  were  no  American  troops  on  Iwo  Jima  prior  to 
the  start  of  the  engagement. 

2.  Combat  losses.  Captain  Clifford  Morehouse  of  the  United  States 
Marine  Corps  (see  Morehouse  [5])  kept  a  daily  count  of  all  American  com¬ 
bat  losses.  Unfortunately,  no  such  records  are  available  for  the  Japanese 
forces.  Most  probably,  the  casualty  lists  kept  by  General  Kuribayashi 
(commander  of  the  Japanese  forces  on  Iwo  Jima)  were  destroyed  in  the 
battle  itself,  while  whatever  records  were  kept  in  Tokyo  were  consumed  in 
the  fire  bombings  of  the  remaining  five  months  of  the  war.  However,  we 
can  infer  from  Table  1  that  approximately  21,500  Japanese  forces  were  on 
Iwo  Jima  at  the  start  of  the  battle.  (Actually,  Newcomb  arrived  at  the  fig- 
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ure  of  21,000  for  the  Japanese  forces,  but  this  is  a  little  low  since  he  appar¬ 
ently  did  not  include  some  of  the  living  and  dead  found  in  the  caves  in  the 
final  days.) 

3.  Operational  losses.  The  operational  losses  on  both  sides  were  negligi¬ 
ble. 

Now,  let  x(t)  and  y(t)  denote  respectively,  the  active  American  and 
Japanese  forces  on  Iwo  Jima  t  days  after  the  battle  began.  The  data  above 
suggests  the  following  Lanchestrian  model  for  the  battle  of  Iwo  Jima: 


dx 

dt 


-ay+f(t) 


dy_ 

dt 


=  —  bx 


where  a  and  b  are  the  combat  effectiveness  coefficients  of  the  Japanese 
and  American  forces,  respectively,  and 


r 


54,000 

0 

6,000 

0 

13,000 

0 


0<  t<  1 

1  <  f  <2 

2  <  /  <3 
3<t<5 
5<t<6 

t>  6 


Using  the  method  of  variation  of  parameters  developed  in  Section  3.12  or 
the  method  of  elimination  in  Section  2.14,  it  is  easily  seen  that  the  solution 
of  (6)  which  satisfies  jc(0)  =  0,  y(0)=yo= 21,500  is  given  by 


x 


and 


where 


(0=-  Vf  y0coshVabt  +  J  cosh Vab  (t  —  s)f(s)ds  (7a) 

y(t)=y0coshVab  t  —  sinh Vab  (t  —  s)f(s)ds  (7b) 


cosh;c=(e*  + e  *)/2  and  sinhx=(ex  —  e  x)/2. 


The  problem  before  us  now  is  this:  Do  there  exist  constants  a  and  b  so 
that  (7a)  yields  a  good  fit  to  the  data  compiled  by  Morehouse?  This  is  an 
extremely  important  question.  An  affirmative  answer  would  indicate  that 
Lanchestrian  models  do  indeed  describe  real  life  battles,  while  a  negative 
answer  would  shed  a  dubious  light  on  much  of  Lanchester’s  work. 

As  we  mentioned  previously,  it  is  extremely  difficult  to  compute  the 
combat  effectiveness  coefficients  a  and  b  of  two  opposing  forces.  However, 
it  is  often  possible  to  determine  suitable  values  of  a  and  b  once  the  data  for 
the  battle  is  known,  and  such  is  the  case  for  the  battle  of  Iwo  Jima. 
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The  calculation  of  a  and  b.  Integrating  the  second  equation  of  (6)  between 
0  and  s  gives 


so  that 


y(s)-y0=  -bf  x(t)dt 

J0 


yo~y(s) 


In  particular,  setting  s  =  36  gives 

t  y<>-y(M) 

b  = - 


21,500 


(8) 


(9) 


Now  the  integral  on  the  right-hand  side  of  (9)  can  be  approximated  by  the 
Riemann  sum 


and  for  x(i)  we  enter  the  number  of  effective  American  troops  on  the  ith 
day  of  the  battle.  Using  the  data  available  from  Morehouse,  we  compute 
for  b  the  value 


21,500 

2,037,000 


0.0106. 


Remark.  We  would  prefer  to  set  £  =  28  in  (8)  since  that  was  the  day  the  is¬ 
land  was  declared  secure,  and  the  fighting  was  only  sporadic  after  this  day. 
However,  we  don’t  know  y  (28).  Thus,  we  are  forced  here  to  take  £  =  36. 

Next,  we  integrate  the  first  equation  of  (6)  between  /  =  0  and  /  =  28  and 
obtain  that 


x 


=  —  a 


y(t)dt  +  73,000. 


There  were  52,735  effective  American  troops  on  the  28th  day  of  the  battle. 
Thus 


73,000-52,735  _  20,265 

r28  r2S 

I  y{t)dt  /  y(t)dt 
Jo  Jo 
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Finally,  we  approximate  the  integral  on  the  right-hand  side  of  (11)  by  the 
Riemann  sum 


and  we  approximate  y(j)  by 


y(j)=yo~b  fJx(t)dt 

J0 

j 

»21,500-&2*(0- 


/  =  / 


Again,  we  replace  *(/)  by  the  number  of  effective  American  troops  on  the 
/th  day  of  the  battle.  The  result  of  this  calculation  is  (see  Engel  [2]) 


20,265 

372,500 


0.0544. 


Figure  3  below  compares  the  actual  American  troop  strength  with  the  val¬ 
ues  predicted  by  Equation  (7a)  (with  a  =  0.0544  and  b  =  0.0106).  The  fit  is 
remarkably  good.  Thus,  it  appears  that  a  Lanchestrian  model  does  indeed 
describe  real  life  engagements. 


Figure  3.  Comparison  of  actual  troop  strength  with  predicted  troop  strength 


Remark.  The  figures  we  have  used  for  American  reinforcements  include 
all  the  personnel  put  ashore,  both  combat  troops  and  support  troops.  Thus 
the  numbers  a  and  b  that  we  have  computed  should  be  interpreted  as  the 
average  effectiveness  per  man  ashore. 
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Exercises 

1.  Derive  Equations  (7a)  and  (7b). 

2.  The  system  of  equations 


x  —  —  ay 
y=  -  by  -cxy 


is  a  Lanchestrian  model  for  conventional-guerilla  combat,  in  which  the  opera¬ 
tional  loss  rate  of  the  guerilla  forces  is  proportional  to >>(/). 

(a)  Find  the  orbits  of  (13). 

(b)  Who  wins  the  battle? 


3.  The  system  of  equations 


x—  —  ay 
y  —  —  bx  —  cxy 


(14) 


is  a  Lanchestrian  model  for  conventional-guerilla  combat,  in  which  the  opera¬ 
tional  loss  rate  of  the  guerilla  force  y  is  proportional  to  the  strength  of  the  con¬ 
ventional  force  x.  Find  the  orbits  of  (14). 


4.  The  system  of  equations 


x  =  —  cxy 
y  =  —  dxy 


is  a  Lanchestrian  model  for  guerilla-guerilla  combat  in  which  the  operational 
loss  rates  are  negligible. 

(a)  Find  the  orbits  of  (15). 

(b)  Who  wins  the  battle? 


5.  The  system  of  equations 


i  =  —  ay  —  cxy 
y  —  —  bx  —  dxy 


is  a  Lanchestrian  model  for  guerilla-guerilla  combat  in  which  the  operational 
loss  rate  of  each  force  is  proportional  to  the  strength  of  its  opponent.  Find  the 
orbits  of  (16). 
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6.  The  system  of  equations 

x=  —  ax  —  cxy 
y=  —  by  —  dxy 

is  a  Lanchestrian  model  for  guerilla-guerilla  combat  in  which  the  operational 
loss  rate  of  each  force  is  proportional  to  its  strength. 

(a)  Find  the  orbits  of  (17). 

(b)  Show  that  the  x  andy  axes  are  both  orbits  of  (17). 

(c)  Using  the  fact  (to  be  proved  in  Section  4.6)  that  two  orbits  of  (17)  cannot 
intersect,  show  that  there  is  no  clear-cut  winner  in  this  battle.  Hint :  Show  that 
x{t)  an dy(/)  can  never  become  zero  in  finite  time.  (Using  lemmas  1  and  2  of 
Section  4.8,  it  is  easy  to  show  that  both  x(t)  andy(/)  approach  zero  as  t-* oo.) 


4.6  Qualitative  properties  of  orbits 

In  this  section  we  will  derive  two  very  important  properties  of  the  solutions 
and  orbits  of  the  system  of  differential  equations 


x=f(x),  x  = 

• 

• 

• 

>  f(x)  = 

f\  (-^1>  *  *  *  >-*71 ) 

• 

• 

• 

W 

fn  C*l>  *  *  *  >Xn) 

The  first  property  deals  with  the  existence  and  uniqueness  of  orbits,  and 
the  second  property  deals  with  the  existence  of  periodic  solutions  of  (1). 
We  begin  with  the  following  existence-uniqueness  theorem  for  the  solu¬ 
tions  of  (1). 

Theorem  3.  Let  each  of  the  functions  fx(xx, . . .  ,x„), . . .  ,fn(xl9 . . . ,  xn)  have  con¬ 
tinuous  partial  derivatives  with  respect  to  xl9 . .  .,x„.  Then ,  the  initial-value 
problem  x  =  f(x),  x(/0)  =  x°  has  one ,  and  only  one  solution  x  =  x(/),  for  ev¬ 
ery  x°  in  R". 

We  prove  Theorem  3  in  exactly  the  same  manner  as  we  proved  the  ex¬ 
istence-uniqueness  theorem  for  the  scalar  differential  equation  x=/(/,x). 
Indeed,  the  proof  given  in  Section  1.10  carries  over  here  word  for  word. 
We  need  only  interpret  the  quantity  |x(Q  — y(/)|,  where  x(/)  and  y(/)  are 
vector-valued  functions,  as  the  length  of  the  vector  x(/)-y(/).  That  is  to 
say,  if  we  interpret  |x(Q  —  y(/)|  as 

WO  ■ —  yCOI =max{  |x,  (/)  -7,  (01,  •  •  • ,  k(0  -y„(t)\ }> 

then  the  proof  of  Theorem  2,  Section  1.10  is  valid  even  for  vector- valued 
functions  f(f,x)  (see  Exercises  13-14). 

Next,  we  require  the  following  simple  but  extremely  useful  lemma. 

Lemma  1.  Ifx=<j>(t)  is  a  solution  of  (1),  then  x  =  <j>(t  +  c)  is  again  a  solution 

of  (If 
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The  meaning  of  Lemma  1  is  the  following.  Let  x=<j>(t)  be  a  solution  of 
(1)  and  let  us  replace  every  t  in  the  formula  for  ^(/)  by  /4-c.  In  this 
manner  we  obtain  a  new  function  x(/)=^(/  +  c).  Lemma  1  states  that  x(7) 
is  again  a  solution  of  (1).  For  example,  xx  =  tan/,  x2  =  sec 2t  is  a  solution  of 
the  system  of  differential  equations  dxx/ dt  =  jc2,  dx2/ dt  =  2xxx2.  Hence, 
Xj  =  tan(f  +  c),  x2  =  sec 2(/  +  c)  is  again  a  solution,  for  any  constant  c. 

Proof  of  lemma  1.  If  x=<£(/)  is  a  solution  of  (1),  then  d<j>(t)/dt=f(<l>(t)); 
that  is,  the  two  functions  d$(t)/dt  and  h(7)=f(^(/))  agree  at  every  single 
time.  Fix  a  time  t  and  a  constant  c.  Since  d<j>/ dt  and  h  agree  at  every  time, 
they  must  agree  at  time  1 4-  c.  Hence, 

4-  c)  =  h(y  4-  c)  =  4-  c)). 

But,  d$/dt  evaluated  at  t  +  c  equals  the  derivative  of  x(/)  =  ^(/  +  c), 
evaluated  at  t.  Therefore, 

+  c)  =  +  c))-  □ 


Remark  1.  Lemma  1  can  be  verified  explicitly  for  the  linear  equation  x  = 
Ax.  Every  solution  x(7)  of  this  equation  is  of  the  form  x(/)  =  eA'v,  for  some 
constant  vector  v.  Hence, 

x(  t  +  c)  =  e + c)v  =  eXteXcy 

since  (A/)Ac  =  Ac(AQ  for  all  values  of  t  and  c.  Therefore,  x(/  +  c)  is  again 
a  solution  of  x  =  Ax  since  it  is  of  the  form  eXt  times  the  constant  vector 
eXc\. 

Remark  2.  Lemma  1  is  not  true  if  the  function  f  in  (1)  depends  explicitly 
on  t.  To  see  this,  suppose  that  x  =  <£(7)  is  a  solution  of  the  nonautonomous 
differential  equation  x  =  i(t ,  x).  Then,  <$>(t  +  c)  =  f(/  4-  c ,  ${t  4-  c)).  Conse¬ 
quently,  the  function  x  =  <£(74-c)  satisfies  the  differential  equation 

x  =  f(/4-c,x), 

and  this  equation  is  different  from  (1)  if  f  depends  explicitly  on  t. 

We  are  now  in  a  position  to  derive  the  following  extremely  important 
properties  of  the  solutions  and  orbits  of  (1). 

Property  1.  (Existence  and  uniqueness  of  orbits.)  Let  each  of  the  functions 
fx(xl9 . . . , xn ), . . .  Jn (xx, . . . , xn)  have  continuous  partial  derivatives  with  re¬ 
spect  to  xx,...,xn.  Then,  there  exists  one,  and  only  one,  orbit  through 
every  point  x°  in  R".  In  particular,  if  the  orbits  of  two  solutions  x  =  <£(0 
and  x  =  of  (1)  have  one  point  in  common,  then  they  must  be  identi¬ 

cal. 
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Property  2.  Let  x  =  <£(/)  be  a  solution  of  (1).  If  <£(/0  +  r)=$(t0)  for  some  t0 
and  r>0,  then  ^(7  +  T )  is  identically  equal  to  In  other  words,  if  a 
solution  x(7)  of  (1)  returns  to  its  starting  value  after  a  time  T  >0,  then  it 
must  be  periodic,  with  period  T  (i.e.  it  must  repeat  itself  over  every  time 
interval  of  length  T.) 

Proof  of  property  1.  Let  x°  be  any  point  in  the  ^-dimensional  phase 
space  xl9...9xn9  and  let  x  =  <£(7)  be  the  solution  of  the  initial-value  problem 
x  =  f(x),  x(0)  =  x°.  The  orbit  of  this  solution  obviously  passes  through  x°. 
Hence,  there  exists  at  least  one  orbit  through  every  point  x°.  Now,  suppose 
that  the  orbit  of  another  solution  x  =  \p(t)  also  passes  through  x°.  This 
means  that  there  exists  t0(¥=0)  such  that  ^(70)  =  x°.  By  Lemma  1, 

x  =  iK'  +  'o) 

is  also  a  solution  of  (1).  Observe  that  $(t  +  t0)  and  <£(/)  have  the  same 
value  at  /  =  0.  Hence,  by  Theorem  3,  ^(7  +  f0)  ecluals  f°r  ti me  *• 

This  implies  that  the  orbits  of  <£(/)  and  \p(t)  are  identical.  To  wit,  if  £  is  a 
point  on  the  orbit  of  <£(7);  that  is,  £=<£(7i)  for  some  tl9  then  £  is  also  on  the 
orbit  of  )p(t),  since  £=<f>(fi)  =  >Kfi  +  *o)-  Conversely,  if  £  is  a  point  on  the 
orbit  of  i^(7);  that  is,  there  exists  t2  such  that  yfr(t2)=&  then  £  is  also  on  the 
orbit  of  since  £=  ^(72)  =  ~  *o)-  □ 

Proof  of  property  2.  Let  x  =  <£(/)  be  a  solution  of  (1)  and  suppose  that 
<£(70+  T)  =  <£(70)  for  some  numbers  t0  and  T.  Then,  the  function  ^(0  = 


$(t+T)  is  also  a  solution  of  (1)  which  agrees  with  at  time  t  =  t0 .  By 
Theorem  3,  therefore,  i//(7)  =  <£(7  +  T)  is  identically  equal  to  $(t).  □ 

Property  2  is  extremely  useful  in  applications,  especially  when  n  =  2.  Let 
x  =  x(t),  y=y(t)  be  a  periodic  solution  of  the  system  of  differential  equa¬ 
tions 

(2) 


If  x(t+  T)  =  x(t)  and  y(t+  T)=y(t%  then  the  orbit  of  this  solution  is  a 
closed  curve  C  in  the  x-y  plane.  In  every  time  interval  t0<  t  <  t0+  T,  the 
solution  moves  once  around  C.  Conversely,  suppose  that  the  orbit  of  a 
solution  x  =  x(t),  y  =y{t)  of  (2)  is  a  closed  curve  containing  no  equilibrium 
points  of  (2).  Then,  the  solution  x  =  x(t\y  =y(t)  is  periodic.  To  prove  this, 
recall  that  a  solution  x  =  x(7),  y=y(t)  of  (2)  moves  along  its  orbit  with 
velocity  [f\x9y)  +  g\x9y)]x/1.  If  its  orbit  C  is  a  closed  curve  containing  no 
equilibrium  points  of  (2),  then  the  function  [f2(x,y)  +  g2(x,y)]1/2  has  a 
positive  minimum  for  (x,y)  on  C.  Hence,  the  orbit  of  x  =  x  (0>y-y(0 
must  return  to  its  starting  point  x0  =  x(t0)9y0:=y(to)  in  some  finite  time  T. 
But  this  implies  that  x(t+  T)  =  x(t)  and y(t+  T)=y(t )  for  all  t. 

Example  1.  Prove  that  every  solution  z(7)  of  the  second-order  differential 
equation  ( d2z / dt 2)  +  z  +  z3  =  0is  periodic. 
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Proof.  We  convert  this  second-order  equation  into  a  system  of  two  first- 
order  equations  by  setting  x  —  z,y  =  dz/dt.  Then, 

dx  _  dy 

dt  dt 

The  orbits  of  (3)  are  the  solution  curves 


—  —  x  —  x 


(3) 


y  ,  x2  x4  2 

T4TfT  =  c 


(4) 


of  the  scalar  equation  dy  I  dx  —  —  (x  +  x3)/y.  Equation  (4)  defines  a  closed 
curve  in  the  x-y  plane  (see  Exercise  7).  Moreover,  the  only  equilibrium 
point  of  (3)  is  x  =  0,y  =0.  Consequently,  every  solution  x  =  z(t),y  =  z'{t)  of 
(3)  is  a  periodic  function  of  time.  Notice,  however,  that  we  cannot  compute 
the  period  of  any  particular  solution.  □ 


Example  2.  Prove  that  every  solution  of  the  system  of  differential  equa¬ 
tions 


dx 

dt 


=ye 


1  +x  2+y2 

9 


dt 


=  -xel+x2+y2 


is  periodic. 

Solution.  The  orbits  of  (5)  are  the  solution  curves  x2+y2  =  c2  of  the  first- 
order  scalar  equation  dy  /  dx  —  -  x/y.  Moreover,  x  =  0,  y  =  0  is  the  only 
equilibrium  point  of  (5).  Consequently,  every  solution  x=x(0,y=y(t)  of 
(5)  is  a  periodic  function  of  time. 


Exercises 


1.  Show  that  all  solutions  x(t),y(t)  of 

dx  2  ,  dy 

-3F=x+ysmx,  ^ 


—  1  +  xy  4-  cosy 


which  start  in  the  first  quadrant  (x  >  0,  y  >  0)  must  remain  there  for  all  time 
(both  backwards  and  forwards). 


2.  Show  that  all  solutions  x(t\  y(t)  of 

t  -*«x-  '>• 


dy 

dt 


=  x  +  ey 


which  start  in  the  right  half  plane  (x  >  0)  must  remain  there  for  all  time. 

3.  Show  that  all  solutions  x{t),y{t)  of 

$  =  1 +x2+?2’  %=xy  +  XBny 

which  start  in  the  upper  half  plane  ( y  >  0)  must  remain  there  for  all  time. 

4.  Show  that  all  solutions  x(t),y(t)  of 


dx 


=  -l-y  +  x2. 


dy 


=  x  +  xy 


dt  '  ’  dt 

which  start  inside  the  unit  circle  x2  +y2  =  1  must  remain  there  for  all  time. 
Hint :  Compute  d(x2+y2)/dt. 
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5.  Let  x be  a  solution  of 

dx 


dt 


—y  +  x' 


dy 

dt 


=  x+y‘ 


with  x(t0)^y(t0).  Show  that  x(t)  can  never  equal  y(t). 

6.  Can  a  figure  8  ever  be  an  orbit  of 

wmf{x>y)t  %~g{x’y) 

where  /  and  g  have  continuous  partial  derivatives  with  respect  to  x  and  yl 

7.  Show  that  the  curve  y2+ x2  +  x4/2  =  2c2  is  closed.  Hint :  Show  that  there  exist 
two  points  y  =  0,  x  =  ±  a  which  lie  on  this  curve. 

Prove  that  all  solutions  of  the  following  second-order  equations  are  peri¬ 
odic. 


8.  ^+z3  =  0 
dt 2 

10.  ^+e*2=l 

dt 2 

12.  Show  that  all  solutions  z(t)  of 

d2z 
dt 2 

:2/n\  i  _2/r\\  -4 1 


9.  +z  +  z5= o 

dt 2 

11.4  + 

dt 2 


1  +  z 


=  0 


+  z  —  2z3=0 


are  periodic  if  i2(0)  +  z2(0)  —  z4(0)  <  and  unbounded  if 

i2(  0)  +  z2(0)  -  z4(0)  >^ . 


13.  (a)  Let 

L  =  nX  max  |3/J/0jcy-|,  for  |x  —  x°|<&. 

Show  that  |f(x)  —  f(y)|  <  L|x-y|  if  |x— x°|  <  b  and  |y — x°|  <  b. 

(b)  Let  Af =max|f(x)|  for  |x-x°|  <  b.  Show  that  the  Picard  iterates 

X/+i(0  =  x°+  rf(x/j))A,  xo(0=x° 

Jh 

converge  to  a  solution  x(7)  of  the  initial-value  problem  x=f(x),  x(/0)=x°  on 
the  interval  \t  —  /0|  <  b/M.  Hint :  The  proof  of  Theorem  2,  Section  1.10 
carries  over  here  word  for  word. 

14.  Compute  the  Picard  iterates  Xj(t)  of  the  initial-value  problem  x= Ax,  x(0) = x°, 
and  verify  that  they  approach  eAtx°  as  j  approaches  infinity. 


4.7  Phase  portraits  of  linear  systems 


In  this  section  we  present  a  complete  picture  of  all  orbits  of  the  linear  dif¬ 
ferential  equation 


*-(£)'  A~{“  *)■  (1) 
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This  picture  is  called  a  phase  portrait,  and  it  depends  almost  completely  on 
the  eigenvalues  of  the  matrix  A.  It  also  changes  drastically  as  the  eigenval¬ 
ues  of  A  change  sign  or  become  imaginary. 

When  analyzing  Equation  (1),  it  is  often  helpful  to  visualize  a  vector 


in  R2  as  a  direction,  or  directed  line  segment,  in  the  plane.  Let 


be  a  vector  in  R2  and  draw  the  directed  line  segment  x  from  the  point  (0,0) 
to  the  point  (xl,x2),  as  in  Figure  la.  This  directed  line  segment  is  parallel 
to  the  line  through  (0,0)  with  direction  numbers  xl9x2  respectively.  If  we 
visualize  the  vector  x  as  being  this  directed  line  segment  x,  then  we  see  that 
the  vectors  x  and  cx  are  parallel  if  c  is  positive,  and  antiparallel  if  c  is 
negative.  We  can  also  give  a  nice  geometric  interpretation  of  vector  addi¬ 
tion.  Let  x  and  y  be  two  vectors  in  R2.  Draw  the  directed  line  segment  x, 
and  place  the  vector  /  at  the  tip  of  x .  The  vector  x  +/  is  then  the  composi- 


Figure  2 
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tion  of  these  two  directed  line  segments  (see  Figure  2).  This  construction  is 
known  as  the  parallelogram  law  of  vector  addition. 

We  are  now  in  a  position  to  derive  the  phase  portraits  of  (1).  Let  Aj  and 
X2  denote  the  two  eigenvalues  of  A.  We  distinguish  the  following  cases. 

1.  X2<Xj  <0.  Let  v1  and  v2  be  eigenvectors  of  A  with  eigenvalues  Xx  and 
X2  respectively.  In  the  xl  —  x2  plane  we  draw  the  four  half-lines  lx,  /(,  /2,  and 
/2,  as  shown  in  Figure  3.  The  rays  lx  and  l2  are  parallel  to  v1  and  v2,  while 
the  rays  l[  and  l2  are  parallel  to  —  v1  and  —  v2.  Observe  first  that  x(/)  = 
ceXxtyx  is  a  solution  of  (1)  for  any  constant  c.  This  solution  is  always  pro¬ 
portional  to  v1,  and  the  constant  of  proportionality,  ceXx\  runs  from  ±  oo  to 
0,  depending  as  to  whether  c  is  positive  or  negative.  Hence,  the  orbit  of  this 
solution  is  the  half-line  lx  for  c  >  0,  and  the  half-line  l[  for  c  <  0.  Similarly, 
the  orbit  of  the  solution  x(t)  =  ceXlty2  is  the  half-line  l2  for  c>0,  and  the 
half-line  l2  for  c  <  0.  The  arrows  on  these  four  lines  in  Figure  3  indicate  in 
what  direction  x(Q  moves  along  its  orbit. 

Next,  recall  that  every  solution  x(/)  of  (1)  can  be  written  in  the  form 

x(0  =  c1ex>'v1  +  c2eXi'v2  (2) 

for  some  choice  of  constants  cx  and  c2.  Obviously,  every  solution  x(/)  of  (1) 
approaches  (£)  as  t  approaches  infinity.  Hence,  every  orbit  of  (1)  ap¬ 
proaches  the  origin  x1  =  jc2  =  0  as  t  approaches  infinity.  We  can  make  an 
even  stronger  statement  by  observing  that  eX2tv2  is  very  small  compared  to 
eXl*\l  when  /  is  very  large.  Therefore,  x(t),  for  cx=£  0,  comes  closer  and 
closer  to  cxeXltyx  as  /  approaches  infinity.  This  implies  that  the  tangent  to 
the  orbit  of  x(r)  approaches  lx  if  cx  is  positive,  and  l[  if  cx  is  negative.  Thus, 
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the  phase  portrait  of  (1)  has  the  form  described  in  Figure  3.  The  dis¬ 
tinguishing  feature  of  this  phase  portrait  is  that  every  orbit,  with  the  excep¬ 
tion  of  a  single  line,  approaches  the  origin  in  a  fixed  direction  (if  we  con¬ 
sider  the  directions  v1  and  —  v1  equivalent).  In  this  case  we  say  that  the 
equilibrium  solution  x  =  0  of  (1)  is  a  stable  node. 

Remark.  The  orbit  of  every  solution  x(t)  of  (1)  approaches  the  origin  xx  = 
x2  =  0  as  t  approaches  infinity.  However,  this  point  does  not  belong  to  the 
orbit  of  any  nontrivial  solution  x(/). 

1'.  0<Xx  <\2.  The  phase  portrait  of  (1)  in  this  case  is  exactly  the  same 
as  Figure  3,  except  that  the  direction  of  the  arrows  is  reversed.  Hence,  the 
equilibrium  solution  x(/)  =  0  of  (1)  is  an  unstable  node  if  both  eigenvalues 
of  A  are  positive. 

2.  A,=A2<0.  In  this  case,  the  phase  portrait  of  (1)  depends  on  whether 
A  has  one  or  two  linearly  independent  eigenvectors,  (a)  Suppose  that  A  has 
two  linearly  independent  eigenvectors  v1  and  v2  with  eigenvalue  A<0.  In 
this  case,  every  solution  x(/)  of  (1)  can  be  written  in  the  form 

x(/)  =  cxeXtyx  4-  c2eXty2  =  eXt  (qv1  +  c2v2)  (2) 

for  some  choice  of  constants  c,  and  c2.  Now,  the  vector  eXt(cxyl  +  c2y2)  is 
parallel  to  cxyx  -I-  c2y2  for  all  t.  Hence,  the  orbit  of  every  solution  x(/)  of  (1) 
is  a  half-line.  Moreover,  the  set  of  vectors  (qv1  +  c2v2},  for  all  choices  of  cx 
and  c2,  cover  every  direction  in  the  xx  —  x2  plane,  since  v1  and  v2  are  lin¬ 
early  independent.  Hence,  the  phase  portrait  of  (1)  has  the  form  described 
in  Figure  4a.  (b)  Suppose  that  A  has  only  one  linearly  independent  eigen¬ 
vector  v,  with  eigenvalue  A.  In  this  case,  x1(/)  =  eA/v  is  one  solution  of  (1). 
To  find  a  second  solution  of  (1)  which  is  independent  of  x1,  we  observe 
that  (A  —  AI)2u=0  for  every  vector  u.  Hence, 

x(  t)  =  eAtu  =  eXte^A  ~  AI)'u  =  eXt  [  u  + 1  ( A  —  AI)u  ]  (3) 

is  a  solution  of  (1)  for  any  choice  of  u.  Equation  (3)  can  be  simplified  by 
observing  that  (A  — AI)u  must  be  a  multiple  k  of  v.  This  follows  im¬ 
mediately  from  the  equation  (A  —  AI)[(A  —  AI)u]  =  0,  and  the  fact  that  A  has 
only  one  linearly  independent  eigenvector  v.  Choosing  u  independent  of  v, 
we  see  that  every  solution  x(f)  of  (1)  can  be  written  in  the  form 

x(f)  =  cxeXty  +  c2eXt  (u  4-  kty)  =  eXt  (qv  +  c2u 4*  c2kty ),  (4) 

for  some  choice  of  constants  cx  and  c2.  Obviously,  every  solution  x(/)  of  (1) 
approaches  ($)  as  t  approaches  infinity.  In  addition,  observe  that  c1v4-c2u 
is  very  small  compared  to  c2kty  if  c2  is  unequal  to  zero  and  t  is  very  large. 
Hence,  the  tangent  to  the  orbit  of  x(f)  approaches  ±v  (depending  on  the 
sign  of  c2 )  as  t  approaches  infinity,  and  the  phase  portrait  of  (1)  has  the 
form  described  in  Figure  4b. 
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Figure  4 


2'.  A1=A2>0.  The  phase  portraits  of  (1)  in  the  cases  (2a)'  and  (2b)'  are 
exactly  the  same  as  Figures  4a  and  4b,  except  that  the  direction  of  the 
arrows  is  reversed. 

3.  \x  <0<\2.  Let  v1  and  v2  be  eigenvectors  of  A  with  eigenvalues  Xx  and 
X2  respectively.  In  the  xx  —  x2  plane  we  draw  the  four  half-lines  lv  l[,  l2 ,  and 
l2 ;  the  half-lines  lx  and  l2  are  parallel  to  v1  and  v2,  while  the  half-lines  /{ 
and  l2  are  parallel  to  —  v1  and  —  v2.  Observe  first  that  every  solution  x(Q  of 
(1)  is  of  the  form 


x(/)  =  clex>V  +  c2^'v2  (5) 

for  some  choice  of  constants  cx  and  c2.  The  orbit  of  the  solution  x(/)  = 
cxeX{tyl  is  lx  for  cx  >0  and  l[  for  cx  <0,  while  the  orbit  of  the  solution 
x(0  =  c2eXlty2  is  l2  for  c2>0  and  l2  for  c2<0.  Note,  too,  the  direction  of  the 
arrows  on  lx,  /(,  /2,  and  /2;  the  solution  x(t)  =  cxeXltyl  approaches  (£)  as  t 
approaches  infinity,  whereas  the  solution  x(Q  =  c2eX2*y2  becomes  un¬ 
bounded  (for  c2^ 0)  as  t  approaches  infinity.  Next,  observe  that  eA,V  is 
very  small  compared  to  eXlty2  when  /  is  very  large.  Hence,  every  solution 
x(/)  of  (1)  with  c2 7^0  becomes  unbounded  as  t  approaches  infinity,  and  its 
orbit  approaches  either  l2  or  /2.  Finally,  observe  that  eXlty2  is  very  small 
compared  to  eA,V  when  t  is  very  large  negative.  Hence,  the  orbit  of  any 
solution  x(0  of  (1),  with  cx¥=  0,  approaches  either  lx  or  l[  as  t  approaches 
minus  infinity.  Consequently,  the  phase  portrait  of  (1)  has  the  form  de¬ 
scribed  in  Figure  5.  This  phase  portrait  resembles  a  “saddle”  near  jcj  =  jc2  = 
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Figure  5.  Phase  portrait  of  a  saddle  point 


0.  For  this  reason,  we  say  that  the  equilibrium  solution  x(/)  =  0  of  (1)  is  a 
saddle  point  if  the  eigenvalues  of  A  have  opposite  sign. 

4.  Aj  =  a  +  =  a  —  i/3,/$=£0.  Our  first  step  in  deriving  the  phase 

portrait  of  (1)  is  to  find  the  general  solution  of  (1).  Let  z  =  u  +  /v  be  an  ei¬ 
genvector  of  A  with  eigenvalue  a  +  //?.  Then, 

x(/)  =  e(<*+,0)'(u  +  /v)  =  eat  (cos fit  +  i sin/?/)(u  +  /v) 

=  ea'[ucos/fr — vsin/fr]  +  /eot'[usin/fr  +  vcos/?/] 
is  a  complex-valued  solution  of  (1).  Therefore, 

x1(/)  =  ^a/[ucos^  —  vsin/?/] 
and 


x2(  t )  =  e  at  [  u  sin  fit  +  v  cos  fit  ] 

are  two  real-valued  linearly  independent  solutions  of  (1),  and  every  solu¬ 
tion  x(f)  of  (1)  is  of  the  form  x(/)  =  c1x1(/)  +  c2x2(0.  This  expression  can  be 
written  in  the  form  (see  Exercise  15) 


Rxco§{flt-8x ) 
^2cos(  flt  —  82) 


for  some  choice  of  constants  Rx  >0,  R2>0,  Sl9  and  §2.  We  distinguish  the 
following  cases. 

(a)  a  =  0:  Observe  that  both 

x, (/)  =  Rx cos(  fit  —  8X )  and  x2(/)  =  /*2cos(  fit  —  82) 

are  periodic  functions  of  time  with  period  2tt//3 .  The  function  xx{t)  varies 
between  —  Rx  and  +  Rx,  while  x2(/)  varies  between  —  R2  and  +  R2 .  Conse- 
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quently,  the  orbit  of  any  solution  x(/)  of  (1)  is  a  closed  curve  surrounding 
the  origin  xx  =  x2  =  0,  and  the  phase  portrait  of  (1)  has  the  form  described 
in  Figure  6a.  For  this  reason,  we  say  that  the  equilibrium  solution  x(/)=0 
of  (1)  is  a  center  when  the  eigenvalues  of  A  are  pure  imaginary. 

The  direction  of  the  arrows  in  Figure  6a  must  be  determined  from  the 
differential  equation  (1).  The  simplest  way  of  doing  this  is  to  check  the  sign 
of  x2  when  x2=0.  If  x2  is  greater  than  zero  for  x2  =  0  and  xx  >0,  then  all 
solutions  x(/)  of  (1)  move  in  the  counterclockwise  direction;  if  x2  is  less 
than  zero  for  x2  =  0  and  .XjX),  then  all  solutions  x(/)  of  (1)  move  in  the 
clockwise  direction. 


4.7  Phase  portraits  of  linear  systems 


(b)  a<0:  In  this  case,  the  effect  of  the  factor  eat  in  Equation  (6)  is  to 
change  the  simple  closed  curves  of  Figure  6a  into  the  spirals  of  Figure  6b. 
This  is  because  the  point  x(2i t/ P)  =  e27TOi^x(0)  is  closer  to  the  origin  than 
x(0).  Again,  the  direction  of  the  arrows  in  Figure  6b  must  be  determined 
directly  from  the  differential  equation  (1).  In  this  case,  we  say  that  the 
equilibrium  solution  x(/)=0  of  (1)  is  a  stable  focus. 

(c)  a>0:  In  this  case,  all  orbits  of  (1)  spiral  away  from  the  origin  as  t 
approaches  infinity  (see  Figure  6c),  and  the  equilibrium  solution  x(t)  = 0  of 
(1)  is  called  an  unstable  focus. 

Finally,  we  mention  that  the  phase  portraits  of  nonlinear  systems,  in  the 
neighborhood  of  an  equilibrium  point,  are  often  very  similar  to  the  phase 
portraits  of  linear  systems.  More  precisely,  let  x  =  x°  be  an  equilibrium 
solution  of  the  nonlinear  equation  x  =  f(x),  and  set  u  =  x  — x°.  Then,  (see 
Section  4.3)  we  can  write  the  differential  equation  x  =  f(x)  in  the  form 

u  =  Au+g(u)  (7) 

where  A  is  a  constant  matrix  and  g(u)  is  very  small  compared  to  u.  We 
state  without  proof  the  following  theorem. 

Theorem  4.  Suppose  that  u=0  is  either  a  node ,  saddle ,  or  focus  point  of  the 
differential  equation  u  =  Au.  Then ,  the  phase  portrait  of  the  differential 
equation  x  =  f(x),  in  a  neighborhood  of  x  =  x°,  has  one  of  the  forms  de¬ 
scribed  in  Figures  3,  5,  and  6  (b  and  c),  depending  as  to  whether  u=0wfl 
node ,  saddle ,  or  focus. 

Example  1.  Draw  the  phase  portrait  of  the  linear  equation 

x=Ax  =  (  “Ijx.  (8) 

Solution.  It  is  easily  verified  that 

v'-(|)  and  v’-(J) 

are  eigenvectors  of  A  with  eigenvalues  -3  and  —6,  respectively.  There¬ 
fore,  x  =  0  is  a  stable  node  of  (8),  and  the  phase  portrait  of  (8)  has  the  form 
described  in  Figure  7.  The  half-line  lx  makes  an  angle  of  45°  with  the  xx 
axis,  while  the  half-line  l2  makes  an  angle  of  0  degrees  with  the  xraxis, 
where  tan  0  =  4. 

Example  2.  Draw  the  phase  portrait  of  the  linear  equation 

*=Ax=(_J  _5)x-  w 

Solution.  It  is  easily  verified  that 

,'-(])  and 
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Figure  7.  Phase  portrait  of  (8) 


Figure  8.  Phase  portrait  of  (9) 


are  eigenvectors  of  A  with  eigenvalues  -  2  and  4,  respectively.  Therefore, 
x  =  0  is  a  saddle  point  of  (9),  and  its  phase  portrait  has  the  form  described 
in  Figure  8.  The  half-line  lx  makes  an  angle  of  45°  with  the  Xj-axis,  and  the 
half-line  l2  is  at  right  angles  to  lv 

Example  3.  Draw  the  phase  portrait  of  the  linear  equation 

±-Ax-(:j  _|)x.  (10) 


426 


4.7  Phase  portraits  of  linear  systems 


Solution .  The  eigenvalues  of  A  are  —  1  ±  i.  Hence,  x  =  0  is  a  stable  focus  of 
(10)  and  every  nontrivial  orbit  of  (10)  spirals  into  the  origin  as  t  ap¬ 
proaches  infinity.  To  determine  the  direction  of  rotation  of  the  spiral,  we 
observe  that  x2  =  —  xx  when  x2  =  0.  Thus,  x2  is  negative  for  xx  >0  and  x2  = 
0.  Consequently,  all  nontrivial  orbits  of  (10)  spiral  into  the  origin  in  the 
clockwise  direction,  as  shown  in  Figure  9. 


Exercises 

Draw  the  phase  portraits  of  each  of  the  following  systems  of  differential 
equations. 


10.  Show  that  every  orbit  of 


is  an  ellipse. 

11.  The  equation  of  motion  of  a  spring-mass  system  with  damping  (see  Section  2.6) 
is  mz  +  cz  +  kz  =  0,  where  m,  c,  and  k  are  positive  numbers.  Convert  this  equa¬ 
tion  to  a  system  of  first-order  equations  for  x  =  z,  y  —  z,  and  draw  the  phase 
portrait  of  this  system.  Distinguish  the  overdamped,  critically  damped,  and  un¬ 
derdamped  cases. 

12.  Suppose  that  a  2  X  2  matrix  A  has  2  linearly  independent  eigenvectors  with  ei¬ 
genvalue  X.  Show  that  A=AI. 
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13.  This  problem  illustrates  Theorem  4.  Consider  the  system 


dx 

dt 


=y> 


dy 

dt 


=  x  +  2x3. 


(a)  Show  that  the  equilibrium  solution  x=*0,y  =  0  of  the  linearized  system  x  — 
y,  y  =  x  is  a  saddle,  and  draw  the  phase  portrait  of  the  linearized  system. 

(b)  Find  the  orbits  of  (*),  and  then  draw  its  phase  portrait. 

(c)  Show  that  there  are  exactly  two  orbits  of  (*)  (one  for  x  >  0  and  one  for  x  < 
0)  on  which  x-»0,  as  t-±  oo.  Similarly,  there  are  exactly  two  orbits  of 
(*)  on  which  x->0,  >>-»0  as  t-+—  oo.  Thus,  observe  that  the  phase  portraits 
of  (*)  and  the  linearized  system  look  the  same  near  the  origin. 


14.  Verify  Equation  (6).  Hint :  The  expression  a  cos  cot  +  b  sin  cot  can  always  be 
written  in  the  form  Rcos(ut  -  8)  for  suitable  choices  of  R  and  8. 


4.8  Long  time  behavior  of  solutions; 
the  Poincare-Bendixson  Theorem 

We  consider  now  the  problem  of  determining  the  long  time  behavior  of  all 
solutions  of  the  differential  equation 


x=f(x),  x  = 

*r 

• 

• 

• 

>  f(x)  = 

f\  (Xj, . . .  ,xn) 

• 

• 

• 

t - 

=,* 

v. _ 

f - 

• 

• 

• 

£ 

V 

This  problem  has  been  solved  completely  in  the  special  case  that  f(x)  =  Ax. 
As  we  have  seen  in  Sections  4.2  and  4.7,  all  solutions  x(t)  of  x  =  Ax  must 
exhibit  one  of  the  following  four  types  of  behavior:  (i)  x(t)  is  constant  in 
time;  (ii)  x(/)  is  a  periodic  function  of  time;  (iii)  x(/)  is  unbounded  as  t  ap¬ 
proaches  infinity;  and  (iv)  x(t)  approaches  an  equilibrium  point  as  t  ap¬ 
proaches  infinity. 

A  partial  solution  to  this  problem,  in  the  case  of  nonlinear  f(x),  was 
given  in  Section  4.3.  In  that  section  we  provided  sufficient  conditions  that 
every  solution  x(t)  of  (1),  whose  initial  value  x(0)  is  sufficiently  close  to  an 
equilibrium  point  £,  must  ultimately  approach  £  as  /  approaches  infinity.  In 
many  applications  it  is  often  possible  to  go  much  further  and  prove  that 
every  physically  (biologically)  realistic  solution  approaches  a  single 
equilibrium  point  as  time  evolves.  In  this  context,  the  following  two 
lemmas  play  an  extremely  important  role. 

Lemma  1.  Let  g(t)  be  a  monotonic  increasing  ( decreasing )  function  of  time 
for  t  >  tQ ,  with  g(t)  <  c(>  c)  for  some  constant  c .  Then ,  g(t)  has  a  limit  as 
t  approaches  infinity. 

Proof.  Suppose  that  g{t)  is  monotonic  increasing  for  t  >  /0,  and  g{t)  is 
bounded  from  above.  Let  /  be  the  least  upper  bound  of  g ;  that  is,  /  is  the 
smallest  number  which  is  not  exceeded  by  the  values  of  g(t),  for  t  >  t0.  This 
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number  must  be  the  limit  of  g(t)  as  t  approaches  infinity.  To  prove  this,  let 
e  >  0  be  given,  and  observe  that  there  exists  a  time  such  that  l— g(te) 
<  e.  (If  no  such  time  te  exists,  then  /  is  not  the  least  upper  bound  of  g.) 
Since  g(t)  is  monotonic,  we  see  that  l  —  g(t)<e  for  t>  te.  This  shows  that 
/=lim/_oog(0.  □ 


Lemma  2.  Suppose  that  a  solution  x(/)  of  (1)  approaches  a  vector  £  as  t  ap¬ 
proaches  infinity .  Then,  £  is  an  equilibrium  point  of  (1). 

Proof.  Suppose  that  x(0  approaches  £  as  /  approaches  infinity.  Then, 
Xj(t)  approaches  £,  where  £  is  the  yth  component  of  £  This  implies  that 
|jcf-(/1)  —  Xj(tf)\  approaches  zero  as  both  /,  and  t2  approach  infinity,  since 

M'l)  -  xj(h)\  =  l(*/('l)  )  +  ($“  *y(*2))l 

In  particular,  let  t{  =  t  and  t2sstl  +  h,  for  some  fixed  positive  number  h. 
Then,  \xj(t  + h)- Xj(t)\  approaches  zero  as  t  approaches  infinity.  But 

dxj(r) 

Xj(t  +  h)  —  Xj(t)  =  h  — =  hfj  (x,  (t),  . . .  ,x„(r)), 

where  r  is  some  number  between  t  and  t+h.  Finally,  observe  that 
fj(xi(r)y-’xn(T))  must  approach  j^(£,...,^)  as  t  approaches  infinity. 
Hence, =  0,y  =  1,2,...,«,  and  this  proves  Lemma  1.  □ 


Example  I.  Consider  the  system  of  differential  equations 


dx 

dt 


=  ax  —  bxy  —  ex2. 


dt 


-cy  +  dxy  -fy2 


where  a,  b,  c,  d,  e,  and  /  are  positive  constants.  This  system  (see  Section 
4.10)  describes  the  population  growth  of  two  species  x  and  y,  where  species 
y  is  dependent  upon  species  x  for  its  survival.  Suppose  that  c/d>a/e . 
Prove  that  every  solution  x(t),  y(t)  of  (2),  with  x(0)  and>>(0)>0,  ap¬ 
proaches  the  equilibrium  solution  x  =  a/e,y  =  0,  as  t  approaches  infinity. 
Solution.  Our  first  step  is  to  show  that  every  solution  x(t),y(t)  of  (2)  which 
starts  in  the  first  quadrant  (jc>0,  y  >0)  at  /  =  0  must  remain  in  the  first 
quadrant  for  all  future  time.  (If  this  were  not  so,  then  the  model  (2)  could 
not  correspond  to  reality.)  To  this  end,  recall  from  Section  1.5  that 


*(') - 77 - ’  *(0- 0 

is  a  solution  of  (2)  for  any  choice  of  x0.  The  orbit  of  this  solution  is  the 
point  (0,0)  for  xo  =  0;  the  line  0 <x<a/e  for  0 <x0<a/e;  the  point 
(a/ e, 0)  for  x0  =  a/e;  and  the  line  a/e<x<  oo  for  x0>a/e.  Thus,  the  x- 
axis,  for  x  >  0,  is  the  union  of  four  disjoint  orbits  of  (2).  Similarly,  (see  Ex¬ 
ercise  14),  the  positive  y-axis  is  a  single  orbit  of  (2).  Thus,  if  a  solution 
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Figure  1 


x(t),y(t)  of  (2)  leaves  the  first  quadrant,  its  orbit  must  cross  another  orbit, 
and  this  is  precluded  by  the  uniqueness  of  orbits  (Property  1,  Section  4.6). 

Our  next  step  is  to  divide  the  first  quadrant  into  regions  where  dx  /  dt 
and  dy /dt  have  fixed  signs.  This  is  accomplished  by  drawing  the  lines 
lx:a  —  by  —  ex  =  0,  and  /2:  —  c  +  dx—  fy=0,  in  the  x-y  plane.  These  lines 
divide  the  first  quadrant  into  three  regions  I,  II,  and  III  as  shown  in  Figure 
1.  (The  lines  lx  and  /2  do  not  intersect  in  the  first  quadrant  if  c/d>a/e .) 
Now,  observe  that  ex  +  by  is  less  than  a  in  region  I,  while  ex  +  by  is  greater 
than  a  in  regions  II  and  III.  Consequently,  dx/ dt  is  positive  in  region  I 
and  negative  in  regions  II  and  III.  Similarly,  dy  /  dt  is  negative  in  regions  I 
and  II  and  positive  in  region  III. 

Next,  we  prove  the  following  four  simple  lemmas. 


Lemma  3.  Any  solution  x{t),y{t)  of  (2)  which  starts  in  region  I  at  time  t  =  t0 
will  remain  in  this  region  for  all  future  time  t>  t0  and  ultimately  approach 
the  equilibrium  solution  x  =  a/e,y  =  0. 


Proof.  Suppose  that  a  solution  x(t),y(t)  of  (2)  leaves  region  I  at  time  t  = 
t*.  Then,  x(t*)= 0,  since  the  only  way  a  solution  can  leave  region  I  is  by 
crossing  the  line  lv  Differentiating  both  sides  of  the  first  equation  of  (2) 
with  respect  to  /  and  setting  t  =  t*  gives 


d2x(t*) 

dt2 


bx(t*) 


dt 


This  quantity  is  positive.  Hence,  x{t)  has  a  minimum  at  t  =  t*.  But  this  is 
impossible,  since  x{t)  is  always  increasing  whenever  x(t\  y{t)  is  in  region 
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I.  Thus,  any  solution  x(t ),  y(t)  of  (2)  which  starts  in  region  I  at  time  t  =  t0 
will  remain  in  region  I  for  all  future  time  t  >  t0.  This  implies  that  x(t)  is  a 
monotonic  increasing  function  of  time,  and  y(t)  is  a  monotonic  decreasing 
function  of  time  for  t  >  t0 ,  with  x(t)<a/e  andy(7)>0.  Consequently,  by 
Lemma  1,  both  x(t)  and  y(t)  have  limits  £,77  respectively,  as  /  approaches 
infinity.  Lemma  2  implies  that  (£,77)  is  an  equilibrium  point  of  (2).  Now,  it 
is  easily  verified  that  the  only  equilibrium  points  of  (2)  in  the  region  x  >  0, 
y>  0  are  x  =  0,  y  =  0,  and  x  =  a/  e9y  —  0.  Clearly,  £  cannot  equal  zero  since 
x(t)  is  increasing  in  region  I.  Therefore,  £  =  a/  e  and  77  =  0.  □ 

Lemma  4.  Any  solution  x{t)9y{t)  of  (2)  which  starts  in  region  III  at  time  t  = 
t0  must  leave  this  region  at  some  later  time. 

Proof.  Suppose  that  a  solution  x(t)9  y(t)  of  (2)  remains  in  region  III  for 
all  time  t  >  t0.  Then,  x(t)  is  a  mono  tonic  decreasing  function  of  time,  and 
y  (t)  is  a  monotonic  increasing  function  of  time,  for  t  >  t0.  Moreover,  x(t)  is 
greater  than  c/d  andy(/)  is  less  than  (dx(t0)  —  c)/f.  Consequently,  both 
x{t)  andy(7)  have  limits  £,77  respectively,  as  t  approaches  infinity.  Lemma 
2  implies  that  (£,17)  is  an  equilibrium  point  of  (2).  But  (£,17)  cannot  equal 
(0,0)  or  (a/e90)  if  x(t)9y(t)  is  in  region  III  for  t>  t0.  This  contradiction 
establishes  Lemma  4.  □ 


Lemma  5.  Any  solution  x(t)9y(t)  of  ( 2)  which  starts  in  region  II  at  time  t  = 
t0  and  remains  in  region  II  for  all  future  time  t>  t0  must  approach  the 
equilibrium  solution  x  =  a/ e9  y  =  0. 

Proof.  Suppose  that  a  solution  x(t)9y(i)  of  (2)  remains  in  region  II  for  all 
time  t  >  t0.  Then,  both  x(t)  and  y(t)  are  monotonic  decreasing  functions  of 
time  for  t  >  /0,  with  x(0>0  and  y(0>0.  Consequently,  by  Lemma  1,  both 
x{t)  mdy(t)  have  limits  £,77  respectively,  as  t  approaches  infinity.  Lemma 
2  implies  that  (£,77)  is  an  equilibrium  point  of  (2).  Now,  (£,77)  cannot  equal 
(0, 0).  Therefore,  ^  =  a/e,r]  =  0.  □ 


Lemma  6.  A  solution  x(t),y(t)  of  (2)  cannot  enter  region  III  from  region  II. 


Proof.  Suppose  that  a  solution  x(t),y(t)  of  (2)  leaves  region  II  at  time  t  = 
t*  and  enters  region  III.  Then,  y(/*)  =  0.  Differentiating  both  sides  of  the 
second  equation  of  (2)  with  respect  to  t  and  setting  t  =  t*  gives 


d2y(t*)  dx(t *) 

L=dy{t*) 


dV 


dt 


This  quantity  is  negative.  Hence,  y(t)  has  a  maximum  at  t  =  t*.  But  this  is 
impossible,  since y(t)  is  decreasing  whenever  x(t)9y(t)  is  in  region  II.  □ 


Finally,  observe  that  a  solution  x(t)9  y(t)  of  (2)  which  starts  on  lx  must 
immediately  enter  region  I,  and  that  a  solution  which  starts  on  l2  must  im- 
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mediately  enter  region  II.  It  now  follows  immediately  from  Lemmas  3-6 
that  every  solution  x(t),y(t)  of  (2),  with  x(0)>0  and  >>(0)>0,  approaches 
the  equilibrium  solution  x  —  a/e,  y  =  0  as  t  approaches  infinity. 

Up  to  now,  the  solutions  and  orbits  of  the  nonlinear  equations  that  we 
have  studied  behaved  very  much  like  the  solutions  and  orbits  of  linear 
equations.  In  actual  fact,  though,  the  situation  is  very  different.  The  solu¬ 
tions  and  orbits  of  nonlinear  equations,  in  general,  exhibit  a  completely 
different  behavior  than  the  solutions  and  orbits  of  linear  equations.  A 
standard  example  is  the  system  of  equations 

^  =  -y  +  x(\-x2-y 2),  Yt=x+y(x~x2~y2)-  (3) 

Since  the  term  x2+y2  appears  prominently  in  both  equations,  it  suggests 
itself  to  introduce  polar  coordinates  r,0 ,  where  x  =  rcos0,y  =  rsinO,  and  to 
rewrite  (3)  in  terms  of  r  and  0,  To  this  end,  we  compute 


=  2(x2+  y2)  —  2(x2+y2)2  =  2r2(\  —  r2). 


Similarly,  we  compute 


d0 

dt 


d  ,  y 

—  —arc  tan—  = 
dt  x 


X2  \+{y/xf 


x2-hy 2 
x2+y2 


Consequently,  the  system  of  equations  (3)  is  equivalent  to  the  system  of 
equations 


dr 

dt 


=  r(l-r2), 


d0 

dt 


The  general  solution  of  (4)  is  easily  seen  to  be 


r(t)  = 


0 


rl+(\-rl)e  2r 


0  =  t  +  0t 


o 


where  r0=  r(0)  and  0O=  0 (0).  Hence, 

*(0: 


0 


/o  +  (1~'o)<?  2‘ 


y(0= 


0 


/o  +  (l-'o)e  2‘] 


1/2 


cos(/  +  0o), 
sin(/  +  0o). 


Now,  observe  first  that  x  =  0,  y  =  0  is  the  only  equilibrium  solution  of  (3). 
Second,  observe  that 

x(t)  =  cos(/  +  0O  ),  y(t)  =  sin(/  +  0O  ) 
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Figure  2.  The  phase  portrait  of  (3) 


when  r0=  1.  This  solution  is  periodic  with  period  277,  and  its  orbit  is  the 
unit  circle  x2+y2  =  1.  Finally,  observe  from  (5)  that  r(t)  approaches  one  as 
t  approaches  infinity,  for  r0^0.  Hence,  all  the  orbits  of  (3),  with  the  excep¬ 
tion  of  the  equilibrium  point  x  =  0,  y  =  0,  spiral  into  the  unit  circle.  This 
situation  is  depicted  in  Figure  2. 

The  system  of  equations  (3)  shows  that  the  orbits  of  a  nonlinear  system 
of  equations  may  spiral  into  a  simple  closed  curve.  This,  of  course,  is  not 
possible  for  linear  systems.  Moreover,  it  is  often  possible  to  prove  that 
orbits  of  a  nonlinear  system  spiral  into  a  closed  curve  even  when  we 
cannot  explicitly  solve  the  system  of  equations,  or  even  find  its  orbits.  This 
is  the  content  of  the  following  celebrated  theorem. 

Theorem  5.  (Poincare-Bendixson.)  Suppose  that  a  solution  x  =  x(t),  y  = 
y(t)  of  the  system  of  differential  equations 

%-g(*>y)  (6) 

remains  in  a  bounded  region  of  the  plane  which  contains  no  equilibrium 
points  of  (6).  Then ,  its  orbit  must  spiral  into  a  simple  closed  curve ,  which  is 
itself  the  orbit  of  a  periodic  solution  of  (6). 

Example  2.  Prove  that  the  second-order  differential  equation 

z  +  (z2  +  2z2  —  l)z  +  z  =  0  (7) 

has  a  nontrivial  periodic  solution. 


433 


4  Qualitative  theory  of  differential  equations 


Solution.  First,  we  convert  Equation  (7)  to  a  system  of  two  first-order 
equations  by  setting  x  —  z  and>>  =  i.  Then, 

%-y>  ^  =  -x  +  (l-x2-2 y2)y.  (8) 


Next,  we  try  and  find  a  bounded  region  R  in  the  x  —y  plane,  containing  no 
equilibrium  points  of  (8),  and  having  the  property  that  every  solution  x(t\ 
y(t)  of  (8)  which  starts  in  R  at  time  t  =  t0,  remains  there  for  all  future  time 
t  >  t0.  It  can  be  shown  that  a  simply  connected  region  such  as  a  square  or 
disc  will  never  work.  Therefore,  we  try  and  take  R  to  be  an  annulus 
surrounding  the  origin.  To  this  end,  compute 


d_ 

dt 


+A-(i-S-2S)r\ 


and  observe  that  \-x2-2 y2  is  positive  for  x2+y2<j  and  negative  for 
x2+y2  >1.  Hence,  x2(t)+y2(t)  is  increasing  along  any  solution  x(t),  y(t) 
of  (8)  when  x2+y2<  \  and  decreasing  when  x2+y2>  1.  This  implies  that 

any  solution  x(t),  y(t)  of  (8)  which  starts  in  the  annulus  \  <x2+y2<  1  at 
time  t  =  t0  will  remain  in  this  annulus  for  all  future  time  t  >  /0.  Now,  this 
annulus  contains  no  equilibrium  points  of  (8).  Consequently,  by  the 
Poincare-Bendixson  Theorem,  there  exists  at  least  one  periodic  solution 
x(t)>y(t)  of  (8)  lying  entirely  in  this  annulus,  and  then  z  =  x(t)  is  a  nontri¬ 
vial  periodic  solution  of  (7). 


Exercises 


1.  What  Really  Happened  at  the  Paris  Peace  Talks 

The  original  plan  developed  by  Henry  Kissinger  and  Le  Due  Tho  to  settle  the 
Vietnamese  war  is  described  below.  It  was  agreed  that  1  million  South  Viet¬ 
namese  ants  and  1  million  North  Vietnamese  ants  would  be  placed  in  the  back¬ 
yard  of  the  Presidential  palace  in  Paris  and  be  allowed  to  fight  it  out  for  a  long 
period  of  time.  If  the  South  Vietnamese  ants  destroyed  nearly  all  the  North 
Vietnamese  ants,  then  South  Vietnam  would  retain  control  of  all  of  its  land.  If 
the  North  Vietnamese  ants  were  victorious,  then  North  Vietnam  would  take 
over  all  of  South  Vietnam.  If  they  appeared  to  be  fighting  to  a  standoff,  then 
South  Vietnam  would  be  partitioned  according  to  the  proportion  of  ants  re¬ 
maining.  Now,  the  South  Vietnamese  ants,  denoted  by  S ,  and  the  North  Viet¬ 
namese  ants,  denoted  by  V,  compete  against  each  other  according  to  the 
following  differential  equations: 


dS 

dt 

dN 

dt 


10  S  20S  XN 


1 


100 


N- 


1 


100 


N2- 


100 


S  XN. 


Note  that  these  equations  correspond  to  reality  since  the  South  Vietnamese 
ants  multiply  much  more  rapidly  than  the  North  Vietnamese  ants,  but  the 
North  Vietnamese  ants  are  much  better  fighters. 
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The  battle  began  at  10:00  sharp  on  the  morning  of  May  19,  1972,  and  was 
supervised  by  a  representative  of  Poland  and  a  representative  of  Canada.  At 
2:43  p.m.  on  the  afternoon  of  May  21,  the  representative  of  Poland,  being  un¬ 
happy  with  the  progress  of  the  battle,  slipped  a  bag  of  North  Vietnamese  ants 
into  the  backyard,  but  he  was  spotted  by  the  eagle  eyes  of  the  representative  of 
Canada.  The  South  Vietnamese  immediately  claimed  a  foul  and  called  off  the 
agreement,  thus  setting  the  stage  for  the  protracted  talks  that  followed  in  Paris. 
The  representative  of  Poland  was  hauled  before  a  judge  in  Paris  for  sentencing. 
The  judge,  after  making  some  remarks  about  the  stupidity  of  the  South  Viet¬ 
namese,  gave  the  Polish  representative  a  very  light  sentence.  Justify  mathemati¬ 
cally  the  judge’s  decision.  Hint : 

(a)  Show  that  the  lines  V= 2  and  N+  S=  1  divide  the  first  quadrant  into  three 
regions  (see  Figure  3)  in  which  dS /  dt  and  dN /  dt  have  fixed  signs. 

(b)  Show  that  every  solution  S  (t),N(t)  of  (*)  which  starts  in  either  region  I  or 
region  III  must  eventually  enter  region  II. 

(c)  Show  that  every  solution  S(t),N(t )  of  (*)  which  starts  in  region  II  must  re¬ 
main  there  for  all  future  time. 

(d)  Conclude  from  (c)  that  S(t)-*oo  for  all  solutions  S  of  (*)  with 

S(t0)  and  N(t0)  positive.  Conclude  too  that  N(t)  has  a  finite  limit  ( <  2)  as 

»oo. 

(e)  To  prove  that  V(/)— >0,  observe  that  there  exists  t0  such  that  dN /  dt  <  —N 
for  t  >  t0.  Conclude  from  this  inequality  that  V(/)— >0  as  /— >oo. 


Figure  3 


2.  Consider  the  system  of  differential  equations 

~=ax-  bxy,  ^  =  cy  -  dxy  -  ey2  (*) 

with  a/b>c/e.  Prove  thaty(/)->0  as  /— > oo,  for  every  solution  x(t\y(t)  of  (*) 
with  Jt(/0)  and  y(t^)  positive.  Hint :  Follow  the  outline  in  Exercise  1. 


3.  (a)  Without  computing  the  eigenvalues  of  the  matrix 


( 
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approaches  zero  as  t  approaches  infinity.  Hint:  (a)  Show  that  the  lines  x2  = 
3xx  and  xl=3x2  divide  the  xx  -  x2  plane  into  four  regions  (see  Figure  4)  in 
which  xx  and  x2  have  fixed  signs. 

(b)  Show  that  every  solution  x(/)  which  starts  in  either  region  I  or  II  must  re¬ 
main  there  for  all  future  time  and  ultimately  approach  the  equilibrium  solu¬ 
tion  x=0. 

(c)  Show  that  every  solution  x(/)  which  remains  exclusively  in  region  III  or  IV 
must  ultimately  approach  the  equilibrium  solution  x=0. 


A  closed  curve  C  is  said  to  be  a  limit  cycle  of 

x=f(x,y),  y  =  g(x,y)  (*) 


if  orbits  of  (*)  spiral  into  it,  or  away  from  it.  It  is  stable  if  all  orbits  of  (*) 
passing  sufficiently  close  to  it  must  ultimately  spiral  into  it,  and  unstable 
otherwise.  Find  all  limit  cycles  of  each  of  the  following  systems  of  differen¬ 
tial  equations.  (Hint:  Compute  d(x2+y2)/ dt.  Observe  too,  that  C  must  be 
the  orbit  of  a  periodic  solution  of  (*)  if  it  contains  no  equilibrium  points  of 

(*)•) 


x(x2+y2- 2) 

4.  x=-y - , —  5 ,x  =  x-x-xyi 

\x2+y2 

y=y-yi-yx2 

y(x2+y2-  2) 
y  =  x - -  — 

yx2+y2 
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6.  x=y  +  x(x2+y2—l)(x2+y2  —  2)  7.  x=* xy  + xcos(x2+y2) 

y  —  —  x+y(x2+y2—  \\x2+y2  —  2)  y  =  —  x2+j'cos(;c2+.y2) 

8.  (a)  Show  that  the  system 

x=y  +  xf(r)/r ,  -  x+yf(r)/r  (r2  =  x2+y2)  (*) 

has  limit  cycles  corresponding  to  the  zeros  of  / (r).  What  is  the  direction  of 
motion  on  these  curves? 

(b)  Determine  all  limit  cycles  of  (*)  and  discuss  their  stability  if  /(/*)  = 
(r  —  3)2(r2  —  5r+ 4). 

Use  the  Poincare-Bendixson  Theorem  to  prove  the  existence  of  a  nontri¬ 
vial  periodic  solution  of  each  of  the  following  differential  equations. 

9.  z  +  (z2  +  i4  — 2)z=0  10.  z  +  [ln(z2+4i2)]i  +  z=0 

11.  (a)  According  to  Green’s  theorem  in  the  plane,  if  C  is  a  closed  curve  which  is 

sufficiently  “smooth,”  and  if  /  and  g  are  continuous  and  have  continuous 
first  partial  derivatives,  then 

f  U(x,y)dy-g(x,y)dx]  =  //[/* (x,y) +gy(x,y)] dxdy 

C  R 

where  R  is  the  region  enclosed  by  C.  Assume  that  *(/),  y(t)  is  a  periodic 
solution  of  x=f(x,y),  y  =  g(x,y),  and  let  C  be  the  orbit  of  this  solution. 
Show  that  for  this  curve,  the  line  integral  above  is  zero. 

(b)  Suppose  that  fx  +  gy  has  the  same  sign  throughout  a  simply  connected  re¬ 
gion  D  in  the  x  —y  plane.  Show  that  the  system  of  equations  x  =/ (x,>>), 
y  =  g(x,y)  can  have  no  periodic  solution  which  is  entirely  in  D. 

12.  Show  that  the  system  of  differential  equations 

x  =  x  +y2  +  x3,  y  =  —x  +y  +yx2 
has  no  nontrivial  periodic  solution. 

13.  Show  that  the  system  of  differential  equations 

x  =  x  —  xy2+y3f  y  =  3y  —yx2  +  x3 

has  no  nontrivial  periodic  solution  which  lies  inside  the  circle  x2+y2  =  4. 

14.  (a)  Show  that  x=0,y  =  \p(t)  is  a  solution  of  (2)  for  any  function  \p(t)  satisfying 

xf=  -c\p-fip2. 

(b)  Choose  \ p(t0)  >  0.  Show  that  the  orbit  of  x  =  0,  y  =  \p(t)  (for  all  t  for  which  \p 
exists)  is  the  positive  y  axis. 


4.9  Introduction  to  bifurcation  theory 

Consider  the  system  of  equations 

x  =  f(x,e) 

where 


x  = 
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and  e  is  a  scalar.  Intuitively  speaking  a  bifurcation  point  of  (1)  is  a  value  of  e 
at  which  the  solutions  of  (1)  change  their  behavior.  More  precisely,  we  say 
that  e  =  e0  is  a  bifurcation  point  of  (1)  if  the  phase  portraits  of  (1)  for  e<  e0 
and  e  >  e0  are  different. 

Remark.  In  the  examples  that  follow  we  will  appeal  to  our  intuition  in 
deciding  whether  two  phase  portraits  are  the  same  or  are  different.  In  more 
advanced  courses  we  define  two  phase  portraits  to  be  the  same,  or  topologi¬ 
cally  equivalent,  if  there  exists  a  continuous  transformation  of  the  plane 
onto  itself  which  maps  one  phase  portrait  onto  the  other. 

Example  1.  Find  the  bifurcation  points  of  the  system 

*=Ax=(!  -i)x 

Solution.  The  characteristic  polynomial  of  the  matrix  A  is 

p(X)  —  det(A—  XI) 

—  (X  —  l)(X  +  l)  —  £ 

—  X2  —  (l  +  e). 

The  roots  of  p(X)  are  ±/T=i=T  for  e>  —  1,  and  ±y/—  e  —  1  i  for  e<  —  1.  This 
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Figure  2.  Phase  portrait  of  (2)  for  e  =  —  1 

implies  that  x  =  0  is  a  saddle  for  e>  —  1,  and  a  center  for  e<  — 1.  We 
conclude,  therefore,  that  e  =  —  1  is  a  bifurcation  point  of  (2).  It  is  also  clear 
that  Eq.  (2)  has  no  other  bifurcation  points. 

It  is  instructive  to  see  how  the  solutions  of  (2)  change  as  e  passes  through 
the  bifurcation  value  —  1.  For  e>  —  1,  the  eigenvalues  of  A  are 

A|  =/l  +  £  ,  A  2  =  /lTT  . 

It  is  easily  verified  (see  Exercise  10)  that 

i - 

i  1  +  yl  +  e 

1 

is  an  eigenvector  of  A  with  eigenvalue  ^1  +  e ,  while 

x2=  /  1-/T+7 

\  l  / 

is  an  eigenvector  with  eigenvalue  -\/l  +  e.  Hence,  the  phase  portrait  of  (2) 
has  the  form  shown  in  Figure  1.  As  e->  —  1  from  the  left,  the  lines  lx  and  l2 
both  approach  the  line  xx  —  x2.  This  line  is  a  line  of  equilibrium  points  of 
(2)  when  e—  - 1,  while  each  line  xx  —  x2  —  c  0)  is  an  orbit  of  (2)  for 
e  =  —  1.  The  phase  portrait  of  (2)  for  e  —  —  1  is  given  in  Figure  2. 

Example  2.  Find  the  bifurcation  points  of  the  system 

*  =  Ax=(e  -l)X-  (3) 
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Solution.  The  characteristic  polynomial  of  the  matrix  A  is 


p(X)  =  det(A—  XI)  =  det|  * 
—  X(1  +  X)  H-  e 


—  X2  +  X  8 


and  the  roots  of  p(X)  are 


—  l  +  i/l  — 4e 

Ai  = - j - >X2  = 


-l-v/l-4e 

2 


Observe  that  X,  is  positive  and  X2  is  negative  for  e<0.  Hence,  x  =  0  is  a 
saddle  for  e<  0.  For  0  < e<  1  /4,  both  Xj  and  X2  are  negative.  Hence  x  =  0 
is  a  stable  node  for  0  <  e<  1  /4.  Both  X!  and  X2  are  complex,  with  negative 
real  part,  for  e>  1  /4.  Hence  x  =  0  is  a  stable  focus  for  e>  1  /4.  Note  that  the 
phase  portrait  of  (3)  changes  as  e  passes  through  0  and  1  /4.  We  conclude, 
therefore,  that  e  =  0  and  e  =  1  /4  are  bifurcation  points  of  (3). 


Example  3.  Find  the  bifurcation  points  of  the  system  of  equations 


dxx 


dt 

dx 


=  x 


dt 


2-7 


X\  “  X2  —  £. 


Solution,  (i)  We  first  find  the  equilibrium  points  of  (4).  Setting  dxx/dt  =  0 
gives  x2  =  0,  and  then  setting  dx2/dt  =  0  gives  x\  —  e  =  0,  so  that  xx  =  ±Je9 
e>0.  Hence,  (\/e,0)  and  (  —  \/e,0)  are  two  equilibrium  points  of  (4)  for 
£>0.  The  system  (4)  has  no  equilibrium  points  when  e<0.  We  conclude, 
therefore,  that  e=0  is  a  bifurcation  point  of  (4).  (ii)  We  now  analyze  the 
behavior  of  the  solutions  of  (4)  near  the  equilibrium  points  (±]/e  ,0)  to 
determine  whether  this  system  has  any  additional  bifurcation  points.  Setting 


u  =  xx 


v  =  x 


2 


gives 
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The  system  (5)  can  be  written  in  the  form 


dt\vj 


0  1 

2{i  -1 


(!) 


+ 


0 


u 


By  Theorem  4,  the  phase  portrait  of  (4)  near  the  equilibrium  solution 


x 


±i/« ) 

0 


is  determined  by  the  phase  portrait  of  the  linearized  system 


i(:M!)= 


o  1 

±2{e  —  1 


(?) 


To  find  the  eigenvalues  of  A  we  compute 


det(A-AI) 

—  A 

±2{e 

\ 

=  A2  +  A  +2][e  . 


=  det 


1 

1-A 


Hence,  the  eigenvalues  of  A  when  u  =  xl—Je  are 


*i  = 


lH”ylH"8\/e  —  1  —  y  1  8\/e" 

-  ,  X2“  o 


while  the  eigenvalues  of  A  when  u  =  xx  +\/e  are 


1  +  y  1  — 8/e  — 1  —  yl  —  8\/e 

t  ’  ^2 =  t ; 


(6) 


(7) 


Observe  from  (6)  that  >  0,  while  A2  <  0.  Thus,  the  system  (4)  behaves  like 

a  saddle  near  the  equilibrium  points  |  ^  j .  On  the  other  hand,  we  see  from 

(7)  that  both  Xx  and  X2  are  negative  for  0<e<l/64,  and  complex  for 
e>l/64.  Consequently,  the  system  (4)  near  the  equilibrium  solution 


(  0/  ^e^aves  a  stable  node  for  0  <  e  <  1  /64,  and  a  stable  focus  for 

e  >  1  /64.  It  can  be  shown  that  the  phase  portraits  of  a  stable  node  and  a 
stable  focus  are  equivalent.  Consequently,  e  =  1  /64  is  not  a  bifurcation  point 
of  (4). 
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Another  situation  which  is  included  in  the  context  of  bifurcation  theory, 
and  which  is  of  much  current  research  interest  now,  is  when  the  system  (1) 
has  a  certain  number  of  equilibrium,  or  periodic,  solutions  for  e=  e0,  and  a 
different  number  for  e=£e0.  Suppose,  for  example,  that  x(/)  is  an  equi¬ 
librium,  or  periodic,  solution  of  (1)  for  e=0,  and  xl(t),x2(t),...9xk(t)  are 
equilibria,  or  periodic  solutions  of  (1)  for  e^O  which  approach  x(t)  as 
e^O.  In  this  case  we  say  that  the  solutions  x*(7 ),  x2(7 x*(0  bifurcate 
from  x(/).  We  illustrate  this  situation  with  the  following  example. 


Example  4.  Find  all  equilibrium  solutions  of  the  system  of  equations 


dxx 

dt 


3exx  —  3ex2  —  x\  —  x 


dx 

dt 


=  ex 


xxx2 


=  xx(e-x2). 


Solution.  Let  yx\)  an  eclu^rium  solution  of  the  system  (8).  The  second 
equation  of  (8)  implies  that  xx  =  0  or  x2  =  e. 

Xj  =  0.  In  this  case,  the  first  equation  of  (8)  implies  that 


0  =  3ex2  +  x\  —  x2(3  e-b  x2) 

0 
0 

points  of  (8). 

x2  —  e.  In  this  case,  the  first  equation  of  (8)  implies  that 


and 


0 

—  3e 


are  two  equilibrium 


so  that  x2  =  0  or  x2  =  —  3e.  Thus 


3ex!  —  3e2  —  x\  —  e2  =  0 


or 


xx  —3exx  +4e2  =  0. 


The  solutions 


3e±v9e2  —  16e2 


Xi  = 


of  (9)  are  complex.  Thus, 


x>  =  (0  and  x2=(_°3£ 


are  two  equilibrium  points  of  (8),  for  e  ¥=  0,  which  bifurcate  from  the  single 
equilibrium  point  x  =  ( j  when  e  =  0. 
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Exercises 


Find  the  bifurcation  points  of  each  of  the  following  systems  of  equations. 


In  each  of  Problems  6-8,  show  that  more  than  one  equilibrium  solutions 
bifurcate  from  the  equilibrium  solution  x  =  0  when  e  =  0. 


6.  xx  —  exx  —  ex2  —  xx  +  x\  7.  xx  =  exx  —  xx  —  xxx2 

x2  —  ex2  +  xxx2  x2  =  ~2exx  +2ex2  +  xxx2  —  x2 

8.  xx  =  ex2  4-  xxx2 

x2  —  —  exx  +  ex2  +  xx  +  x2 

9.  Consider  the  system  of  equations 
xx  =  3exx  —  5ex2  —  xf  +  x2 

x2  =  2exx  —  £X2.  ^  ' 

(a)  Show  that  each  point  on  the  lines  x2  —  xx  and  x2  —  —  xx  are  equilibrium 
points  of  ( * )  for  e  =  0. 

(b)  Show  that 


are  the  only  equilibrium  points  of  ( * )  for  e  ¥=  0. 
10.  Show  that 


and  x: 


v/T+ 

l 


are  eigenvectors  of  the  matrix  ^  j  with  eigenvalues  vT+7  and  — /f+7 
respectively. 


4.10  Predator-prey  problems;  or  why  the 
percentage  of  sharks  caught  in  the 
Mediterranean  Sea  rose  dramatically 
during  World  War  I 

In  the  mid  1920’s  the  Italian  biologist  Umberto  D’ Ancona  was  studying 
the  population  variations  of  various  species  of  fish  that  interact  with  each 
other.  In  the  course  of  his  research,  he  came  across  some  data  on  per- 
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centages-of-total-catch  of  several  species  of  fish  that  were  brought  into  dif¬ 
ferent  Mediterranean  ports  in  the  years  that  spanned  World  War  I.  In  par¬ 
ticular,  the  data  gave  the  percentage-of-total-catch  of  selachians,  (sharks, 
skates,  rays,  etc.)  which  are  not  very  desirable  as  food  fish.  The  data  for 
the  port  of  Fiume,  Italy,  during  the  years  1914-1923  is  given  below. 


1914 

1915 

1916 

1917 

1918 

11.9% 

21.4% 

22.1% 

21.2% 

36.4% 

1919 

1920 

1921 

1922 

1923 

27.3% 

16.0% 

15.9% 

14.8% 

10.7% 

D’Ancona  was  puzzled  by  the  very  large  increase  in  the  percentage  of 
selachians  during  the  period  of  the  war.  Obviously,  he  reasoned,  the  in¬ 
crease  in  the  percentage  of  selachians  was  due  to  the  greatly  reduced  level 
of  fishing  during  this  period.  But  how  does  the  intensity  of  fishing  affect 
the  fish  populations?  The  answer  to  this  question  was  of  great  concern  to 
D’Ancona  in  his  research  on  the  struggle  for  existence  between  competing 
species.  It  was  also  of  concern  to  the  fishing  industry,  since  it  would  have 
obvious  implications  for  the  way  fishing  should  be  done. 

Now,  what  distinguishes  the  selachians  from  the  food  fish  is  that  the 
selachians  are  predators,  while  the  food  fish  are  their  prey;  the  selachians 
depend  on  the  food  fish  for  their  survival.  At  first,  D’Ancona  thought  that 
this  accounted  for  the  large  increase  of  selachians  during  the  war.  Since  the 
level  of  fishing  was  greatly  reduced  during  this  period,  there  were  more 
prey  available  to  the  selachians,  who  therefore  thrived  and  multiplied 
rapidly.  However,  this  explanation  does  not  hold  any  water  since  there 
were  also  more  food  fish  during  this  period.  D’Ancona’s  theory  only  shows 
that  there  are  more  selachians  when  the  level  of  fishing  is  reduced;  it  does 
not  explain  why  a  reduced  level  of  fishing  is  more  beneficial  to  the  preda¬ 
tors  than  to  their  prey. 

After  exhausting  all  possible  biological  explanations  of  this  phenome¬ 
non,  D’Ancona  turned  to  his  colleague,  the  famous  Italian  mathematician 
Vito  Volterra.  Hopefully,  Volterra  would  formulate  a  mathematical  model 
of  the  growth  of  the  selachians  and  their  prey,  the  food  fish,  and  this 
model  would  provide  the  answer  to  D’Ancona’s  question.  Volterra  began 
his  analysis  of  this  problem  by  separating  all  the  fish  into  the  prey  popula¬ 
tion  x(t)  and  the  predator  population  y  (t).  Then,  he  reasoned  that  the  food 
fish  do  not  compete  very  intensively  among  themselves  for  their  food 
supply  since  this  is  very  abundant,  and  the  fish  population  is  not  very 
dense.  Hence,  in  the  absence  of  the  selachians,  the  food  fish  would  grow 
according  to  the  Malthusian  law  of  population  growth  x  =  ax ,  for  some 
positive  constant  a .  Next,  reasoned  Volterra,  the  number  of  contacts  per 
unit  time  between  predators  and  prey  is  bxy ,  for  some  positive  constant  b. 
Hence,  x  =  ax  —  bxy.  Similarly,  Volterra  concluded  that  the  predators  have 
a  natural  rate  of  decrease  —  cy  proportional  to  their  present  number,  and 
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that  they  also  increase  at  a  rate  dxy  proportional  to  their  present  number  y 
and  their  food  supply  x.  Thus, 


dx 

dt 


=  ax  —  bxy , 


❖ 

dt 


=  —  cy  +  dxy. 


The  system  of  equations  (1)  governs  the  interaction  of  the  selachians 
and  food  fish  in  the  absence  of  fishing.  We  will  carefully  analyze  this  sys¬ 
tem  and  derive  several  interesting  properties  of  its  solutions.  Then,  we  will 
include  the  effect  of  fishing  in  our  model,  and  show  why  a  reduced  level  of 
fishing  is  more  beneficial  to  the  selachians  than  to  the  food  fish.  In  fact,  we 
will  derive  the  surprising  result  that  a  reduced  level  of  fishing  is  actually 
harmful  to  the  food  fish. 

Observe  first  that  (1)  has  two  equilibrium  solutions  x(/)=0,^(/)  =  0  and 
x(t)  =  c/d,y(t)  =  a/b.  The  first  equilibrium  solution,  of  course,  is  of  no  in¬ 
terest  to  us.  This  system  also  has  the  family  of  solutions  x  (t)  =  x0ea‘,y(t)  = 
0  and  x(0  =  0,  y(t)=y0e~ct.  Thus,  both  the  x  and y  axes  are  orbits  of  (1). 
This  implies  that  every  solution  x(t),  y(t)  of  (1)  which  starts  in  the  first 
quadrant  x  >  0,  y  >  0  at  time  t  =  t0  will  remain  there  for  all  future  time  /  > 

^o* 

The  orbits  of  (1),  for  x,y  t*=0  are  the  solution  curves  of  the  first-order 
equation 

dy  _  —cy  +  dxy  _  y{-c  +  dx) 
dx  ax  —  bxy  x(a  —  by) 


This  equation  is  separable,  since  we  can  write  it  in  the  form 

a -by  dy  _  -c  +  dx 
y  dx  x 

Consequently,  a  lny  —  by  +  c  lnx  —  dx  =  kv  for  some  constant  kv  Taking 
exponentials  of  both  sides  of  this  equation  gives 


eby  edx 

for  some  constant  K.  Thus,  the  orbits  of  (1)  are  the  family  of  curves  de¬ 
fined  by  (3),  and  these  curves  are  closed  as  we  now  show. 


Lemma  1.  Equation  (3)  defines  a  family  of  closed  curves  for  x,  y  >0. 


Proof.  Our  first  step  is  to  determine  the  behavior  of  the  functions  f(y)  = 
ya / ehy  and  g{x)  =  xc / edx  for  x  andy  positive.  To  this  end,  observe  that 
/(0)  =  0,/(oo)  =  0,  and  f(y)  is  positive  lor  y  >0.  Computing 


ya  '(a- by) 


we  see  that/(y)  has  a  single  critical  point  at  y  =  a/b.  Consequently,  f(y) 
achieves  its  maximum  value  My±(a/b)a / ea  at y  =  a/b,  and  the  graph  of 
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Figure  1.  (a)  Graph  of  f{y)—yae  by\  (b)  Graph  of  g{x)-  xce~dx 

f(y)  has  the  form  described  in  Figure  la.  Similarly,  g(x)  achieves  its  maxi¬ 
mum  value  Mx  =  (c  /  d)c  /  ec  at  x  —  c/d,  and  the  graph  of  g{x)  has  the  form 
described  in  Figure  lb. 

From  the  preceding  analysis,  we  conclude  that  Equation  (3)  has  no  solu¬ 
tion  x,y  >  0  for  K  >  Mx  My9  and  the  single  solution  x  —  c/dyy  —  a/b  for  K 
=  MxMy.  Thus,  we  need  only  consider  the  case  K=XMy,  where  X  is  a  posi¬ 
tive  number  less  than  Mx.  Observe  first  that  the  equation  xc/edx  =  X  has 
one  solution  x  =  xm<c/d,  and  one  solution  x  =  xM>  c/d.  Hence,  the 
equation 

has  no  solution^  when  x  is  less  than  xm  or  greater  than  xM .  It  has  the  sin¬ 
gle  solution  y  =  a/ b  when  x  =  xm  or  xM ,  and  it  has  two  solutions  yx(x)  and 
y2{x)  for  each  x  between  xm  and  xM.  The  smaller  solution  y  ^x)  is  always 
less  than  a/b ,  while  the  larger  solution  y2{x)  is  always  greater  than  a/b. 
As  x  approaches  either  xm  or  xM ,  bothj^x)  andj^x)  approach  a/b.  Con¬ 
sequently,  the  curves  defined  by  (3)  are  closed  for  x  and  y  positive,  and 
have  the  form  described  in  Figure  2.  Moreover,  none  of  these  closed  curves 
(with  the  exception  of  x  =  c/d,y  =  a/b)  contain  any  equilibrium  points  of 
(1).  Therefore,  all  solutions  x(t),  y(t)  of  (1),  with  x(0)  and>>(0)  positive, 
are  periodic  functions  of  time.  That  is  to  say,  each  solution  x(t ),  y{t)  of 
(1),  with  x(0)  andj>(0)  positive,  has  the  property  that  x(/+  T)  =  x(t)  and 
y  (t  +  T)  =y(t)  for  some  positive  T.  □ 

Now,  the  data  of  D’ Ancona  is  really  an  average  over  each  one  year 
period  of  the  proportion  of  predators.  Thus,  in  order  to  compare  this  data 
with  the  predictions  of  (1),  we  must  compute  the  “average  values”  of  x(t) 
and  y(t),  for  any  solution  x(t)9  y{t)  of  (1).  Remarkably,  we  can  find  these 
average  values  even  though  we  cannot  compute  x{t)  and  >>(/)  exactly.  This 
is  the  content  of  Lemma  2. 
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Figure  2.  Orbits  of  (1)  for  x,y  positive 


Lemma  2.  Let  x{t),y{t)  be  a  periodic  solution  of  ( 1),  with  period  T>  0.  De¬ 
fine  the  average  values  of  x  and  y  as 

i  rr  i  rr 

*=tI0  x^dt>  y-  T Jo  y^dt- 

Then ,  x  —  c/d  and y  —  a/b.  In  other  words ,  the  average  values  of  x(t)  and 
y(t)  are  the  equilibrium  values. 

Proof.  Dividing  both  sides  of  the  first  equation  of  (1)  by  x  gives  x/x  = 
a  — by,  so  that 

—  by{t)]dt. 

Now,  fTx(t)/ x(t)dt  =  \nx(T)-\nx(Q>),  and  this  equals  zero  since  x(T )  = 
j  o 

x(0).  Consequently, 


—  f  by(t)dt  =  —  f  adt=a, 

1  Jq  1  J o 

so  thaty =  a /b.  Similarly,  by  dividing  both  sides  of  the  second  equation  of 
(1)  by  Ty  (t)  and  integrating  from  0  to  T,  we  obtain  that  x  —  c/d.  □ 

We  are  now  ready  to  include  the  effects  of  fishing  in  our  model.  Ob¬ 
serve  that  fishing  decreases  the  population  of  food  fish  at  a  rate  ex{t),  and 
decreases  the  population  of  selachians  at  a  rate  ey{t).  The  constant  e  re¬ 
flects  the  intensity  of  fishing;  i.e.,  the  number  of  boats  at  sea  and  the  num¬ 
ber  of  nets  in  the  water.  Thus,  the  true  state  of  affairs  is  described  by  the 
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modified  system  of  differential  equations 


dx 

dt 


=  ax  —  bxy  —  ex  =  (a  —  e)x  —  bxy 


dt 


=  —  cy  +  dxy  —  ey=  —  (c  +  e)y  +  dxy . 


This  system  is  exactly  the  same  as  (1)  (for  a  —  e>  0),  with  a  replaced  by 
a  —  e,  and  c  replaced  by  c  +  e.  Hence,  the  average  values  of  x(/)  and  >>(/) 
are  now 

-  c+e  -  a—e 

x — T’  y — r 

Consequently,  a  moderate  amount  of  fishing  (e  <  a)  actually  increases  the 
number  of  food  fish,  on  the  average,  and  decreases  the  number  of 
selachians.  Conversely,  a  reduced  level  of  fishing  increases  the  number  of 
selachians,  on  the  average,  and  decreases  the  number  of  food  fish.  This  re¬ 
markable  result,  which  is  known  as  Vol terra’s  principle,  explains  the  data 
of  D’ Ancona,  and  completely  solves  our  problem. 

Volterra’s  principle  has  spectacular  applications  to  insecticide  treat¬ 
ments,  which  destroy  both  insect  predators  and  their  insect  prey.  It  implies 
that  the  application  of  insecticides  will  actually  increase  the  population  of 
those  insects  which  are  kept  in  control  by  other  predatory  insects.  A  re¬ 
markable  confirmation  comes  from  the  cottony  cushion  scale  insect 
(Icerya  purchasi),  which,  when  accidentally  introduced  from  Australia  in 
1868,  threatened  to  destroy  the  American  citrus  industry.  Thereupon,  its 
natural  Australian  predator,  a  ladybird  beetle  (Novius  Cardinalis)  was  in¬ 
troduced,  and  the  beetles  reduced  the  scale  insects  to  a  low  level.  When 
DDT  was  discovered  to  kill  scale  insects,  it  was  applied  by  the  orchardists 
in  the  hope  of  further  reducing  the  scale  insects.  However,  in  agreement 
with  Volterra’s  principle,  the  effect  was  an  increase  of  the  scale  insect! 

Oddly  enough,  many  ecologists  and  biologists  refused  to  accept 
Volterra’s  model  as  accurate.  They  pointed  to  the  fact  that  the  oscillatory 
behavior  predicted  by  Volterra’s  model  is  not  observed  in  most  pre¬ 
dator-prey  systems.  Rather,  most  predator-prey  systems  tend  to 
equilibrium  states  as  time  evolves.  Our  answer  to  these  critics  is  that  the 
system  of  differential  equations  (1)  is  not  intended  as  a  model  of  the  gen¬ 
eral  predator-prey  interaction.  This  is  because  the  food  fish  and  selachians 
do  not  compete  intensively  among  themselves  for  their  available  resources. 
A  more  general  model  of  predator-prey  interactions  is  the  system  of  dif¬ 
ferential  equations 

x  =  ax  —  bxy  —  ex 2,  y=-cy-h  dxy  —fy2.  (6) 

Here,  the  term  ex2  reflects  the  internal  competition  of  the  prey  x  for  their 
limited  external  resources,  and  the  term  fy 2  reflects  the  competition  among 
the  predators  for  the  limited  number  of  prey.  The  solutions  of  (6)  are  not, 
in  general,  periodic.  Indeed,  we  have  already  shown  in  Example  1  of  Sec- 
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tion  4.8  that  all  solutions  x(t ),  y(t)  of  (6),  with  x(0)  and  j>(0)  positive, 
ultimately  approach  the  equilibrium  solution  x  =  a/e ,  j>  =  0  if  c/d  is 
greater  than  a/ e.  In  this  situation,  the  predators  die  out,  since  their  availa¬ 
ble  food  supply  is  inadequate  for  their  needs. 

Surprisingly,  some  ecologists  and  biologists  even  refuse  to  accept  the 
more  general  model  (6)  as  accurate.  As  a  counterexample,  they  cite  the  ex¬ 
periments  of  the  mathematical  biologist  G.  F.  Gause.  In  these  experiments, 
the  population  was  composed  of  two  species  of  protozoa,  one  of  which,  Di- 
dinium  nasatum,  feeds  on  the  other,  Paramecium  caudatum.  In  all  of 
Gause’s  experiments,  the  Didinium  quickly  destroyed  the  Paramecium  and 
then  died  of  starvation.  This  situation  cannot  be  modeled  by  the  system  of 
equations  (6),  since  no  solution  of  (6)  with  x(0)y  (0)^=0  can  reach  x  =  0  or 
y  =  0  in  finite  time. 

Our  answer  to  these  critics  is  that  the  Didinium  are  a  special,  and  atypi¬ 
cal  type  of  predator.  On  the  one  hand,  they  are  ferocious  attackers  and  re¬ 
quire  a  tremendous  amount  of  food;  a  Didinium  demands  a  fresh  Para¬ 
mecium  every  three  hours.  On  the  other  hand,  the  Didinium  don’t  perish 
from  an  insufficient  supply  of  Paramecium.  They  continue  to  multiply,  but 
give  birth  to  smaller  offspring.  Thus,  the  system  of  equations  (6)  does  not 
accurately  model  the  interaction  of  Paramecium  and  Didinium.  A  better 
model,  in  this  case,  is  the  system  of  differential  equations 


dx 

dt 


—  ax  —  bVx  y, 


dy  _  |  d\Tx  y,  x=£0 
dt  [  —  cy,  x  =  0 


It  can  be  shown  (see  Exercise  6)  that  every  solution  x(t),y(t)  of  (7)  with 
x(0)  andy(O)  positive  reaches  x  =  0  in  finite  time.  This  does  not  contradict 
the  existence-uniqueness  theorem,  since  the  function 


*(«•>- x*° 

[  —cy,  x  =  0 

does  not  have  a  partial  derivative  with  respect  to  x  or  y,  at  x  =  0. 

Finally,  we  mention  that  there  are  several  predator-prey  interactions  in 
nature  which  cannot  be  modeled  by  any  system  of  ordinary  differential 
equations.  These  situations  occur  when  the  prey  are  provided  with  a  refuge 
that  is  inaccessible  to  the  predators.  In  these  situations,  it  is  impossible  to 
make  any  definitive  statements  about  the  future  number  of  predators  and 
prey,  since  we  cannot  predict  how  many  prey  will  be  stupid  enough  to 
leave  their  refuge.  In  other  words,  this  process  is  now  random ,  rather  than 
deterministic ,  and  therefore  cannot  be  modeled  by  a  system  of  ordinary  dif¬ 
ferential  equations.  This  was  verified  directly  in  a  famous  experiment  of 
Gause.  He  placed  five  Paramecium  and  three  Didinium  in  each  of  thirty 
identical  test  tubes,  and  provided  the  Paramecium  with  a  refuge  from  the 
Didinium.  Two  days  later,  he  found  the  predators  dead  in  four  tubes,  and 
a  mixed  population  containing  from  two  to  thirty-eight  Paramecium  in  the 
remaining  twenty-six  tubes. 
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Volterra,  V:  “Le9ons  sur  la  theorie  mathematique  de  la  lutte  pour  la  vie.”  Paris, 
1931. 


Exercises 


1.  Find  all  biologically  realistic  equilibrium  points  of  (6)  and  determine  their  stabil¬ 
ity. 

2.  We  showed  in  Section  4.8  that  >>(/)  ultimately  approaches  zero  for  all  solutions 
x (t\y{t)  of  (6),  if  c/ d>  a/ e.  Show  that  there  exist  solutions  x(t),y(t)  of  (6)  for 
which  y(t)  increases  at  first  to  a  maximum  value,  and  then  decreases  to  zero.  (To 
an  observer  who  sees  only  the  predators  without  noticing  the  prey,  such  a  case 
of  a  population  passing  through  a  maximum  to  total  extinction  would  be  very 
difficult  to  explain.) 

3.  In  many  instances,  it  is  the  adult  members  of  the  prey  who  are  chiefly  attacked 
by  the  predators,  while  the  young  members  are  better  protected,  either  by  their 
smaller  size,  or  by  their  living  in  a  different  station.  Let  xx  be  the  number  of 
adult  prey,  x2  the  number  of  young  prey,  and  y  the  number  of  predators.  Then, 

xx  =  —  axxx  +  a2x2  —  bxxy 

x2  =  nxi  —  (ax  +  a2)x2 
y  =  —cy  +  dxxy 


where  a2x2  represents  the  number  of  young  (per  unit  time)  growing  into  adults, 
and  n  represents  the  birth  rate  proportional  to  the  number  of  adults.  Find  all 
equilibrium  solutions  of  this  system. 


4.  There  are  several  situations  in  nature  where  species  1  preys  on  species  2  which  in 
turn  preys  on  species  3.  One  case  of  this  kind  of  population  is  the  Island  of 
Komodo  in  Malaya  which  is  inhabited  by  giant  carnivorous  reptiles,  and  by 
mammals — their  food — which  feed  on  the  rich  vegetation  of  the  island.  We 
assume  that  the  reptiles  have  no  direct  influence  on  the  vegetation,  and  that  only 
the  plants  compete  among  themselves  for  their  available  resources.  A  system  of 
differential  equations  governing  this  interaction  is 


*1  =  -aixl-bnxlx2  +  ciix{x-i 


*2  =  -tf2*2  + £21*1*2 


*3  =  *3*3  -  *4*3  -  C31*l*3 


Find  all  equilibrium  solutions  of  this  system. 

5.  Consider  a  predator-prey  system  where  the  predator  has  alternate  means  of  sup¬ 
port.  This  system  can  be  modelled  by  the  differential  equations 


*1  =  «l*l(  Pi  -  *i)  +  Yl*l*2 

*2  =  «2*2(  Pi  ~  *2)  “  Y2*l*2 


where  x{(t)  and  x2(t)  are  the  predators  and  prey  populations,  respectively,  at 
time  t. 
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(a)  Show  that  the  change  of  coordinates  fty/(0  =  */(*/«/ A)  reduces  this  system 
of  equations  to 

y\  =yi  (i  -y\)  +  <*\y\yi>  yi =yi(\  -yi)  ~  Wiyi 

where  at  =  y1/32/alj81  and  a2  =  y2^i/«2^2- 

(b)  What  are  the  stable  equilibrium  populations  when  (i)  0<a2<  1,  (ii)  a2>  1? 

(c)  It  is  observed  that  al  —  3a2{a2  isdi  measure  of  the  aggressiveness  of  the  pre¬ 
dator).  What  is  the  value  of  a2  if  the  predator’s  instinct  is  to  maximize  its 
stable  equilibrium  population? 

6.  (a)  Let  x(t)  be  a  solution  of  x  =  ax  —  AfVx  ,  with  M  >a \A:(/0)  .  Show  that 

aVx  =  M — —  a}/ x(t0) 

(b)  Conclude  from  (a)  that  x{t)  approaches  zero  in  finite  time. 

(c)  Let  x(t),y(t)  be  a  solution  of  (7),  with  by(t0)>a'\/x(t0 )  .  Show  that  x(t) 
reaches  zero  in  finite  time.  Hint:  Observe  that  >>(/)  is  increasing  for  t  >  /0- 

(d)  It  can  be  shown  that  by(t)  will  eventually  exceed  a^\f  x{t)  for  every  solu¬ 
tion  x(t)y  y{t)  of  (7)  with  jc(/0)  andy(/0)  positive.  Conclude,  therefore,  that 
all  solutions  x(t ),  y(t)  of  (7)  achieve  x=0  in  finite  time. 


4. 1 1  The  principle  of  competitive  exclusion 
in  population  biology 

It  is  often  observed,  in  nature,  that  the  struggle  for  existence  between  two 
similar  species  competing  for  the  same  limited  food  supply  and  living 
space  nearly  always  ends  in  the  complete  extinction  of  one  of  the  species. 
This  phenomenon  is  known  as  the  “principle  of  competitive  exclusion.”  It 
was  first  enunciated,  in  a  slightly  different  form,  by  Darwin  in  1859.  In  his 
paper  ‘The  origin  of  species  by  natural  selection’  he  writes:  “As  the  species 
of  the  same  genus  usually  have,  though  by  no  means  invariably,  much  sim¬ 
ilarity  in  habits  and  constitutions  and  always  in  structure,  the  struggle  will 
generally  be  more  severe  between  them,  if  they  come  into  competition  with 
each  other,  than  between  the  species  of  distinct  genera.” 

There  is  a  very  interesting  biological  explanation  of  the  principle  of 
competitive  exclusion.  The  cornerstone  of  this  theory  is  the  idea  of  a 
“niche.”  A  niche  indicates  what  place  a  given  species  occupies  in  a  com¬ 
munity;  i.e.,  what  are  its  habits,  food  and  mode  of  life.  It  has  been  ob¬ 
served  that  as  a  result  of  competition  two  similar  species  rarely  occupy  the 
same  niche.  Rather,  each  species  takes  possession  of  those  kinds  of  food 
and  modes  of  life  in  which  it  has  an  advantage  over  its  competitor.  If  the 
two  species  tend  to  occupy  the  same  niche  then  the  struggle  for  existence 
between  them  will  be  very  intense  and  result  in  the  extinction  of  the 
weaker  species. 

An  excellent  illustration  of  this  theory  is  the  colony  of  terns  inhabiting 
the  island  of  Jorilgatch  in  the  Black  Sea.  This  colony  consists  of  four  diffe¬ 
rent  species  of  terns:  sandwich- tern,  common-tern,  blackbeak-tern,  and  lit- 
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tie-tern.  These  four  species  band  together  to  chase  away  predators  from  the 
colony.  However,  there  is  a  sharp  difference  between  them  as  regards  the 
procuring  of  food.  The  sandwich-tern  flies  far  out  into  the  open  sea  to  hunt 
certain  species,  while  the  blackbeak-tern  feeds  exclusively  on  land.  On  the 
other  hand,  common-tern  and  little-tern  catch  fish  close  to  the  shore.  They 
sight  the  fish  while  flying  and  dive  into  the  water  after  them.  The  little-tern 
seizes  his  fish  in  shallow  swampy  places,  whereas  the  common-tern  hunts 
somewhat  further  from  shore.  In  this  manner,  these  four  similar  species  of 
tern  living  side  by  side  upon  a  single  small  island  differ  sharply  in  all  their 
modes  of  feeding  and  procuring  food.  Each  has  a  niche  in  which  it  has  a 
distinct  advantage  over  its  competitors. 

In  this  section  we  present  a  rigorous  mathematical  proof  of  the  law  of 
competitive  exclusion.  This  will  be  accomplished  by  deriving  a  system  of 
differential  equations  which  govern  the  interaction  between  two  similar 
species,  and  then  showing  that  every  solution  of  the  system  approaches  an 
equilibrium  state  in  which  one  of  the  species  is  extinct. 

In  constructing  a  mathematical  model  of  the  struggle  for  existence  be¬ 
tween  two  competing  species,  it  is  instructive  to  look  again  at  the  logistic 
law  of  population  growth 

—■  =aN  —  bN2.  (1) 

dt  v  ' 

This  equation  governs  the  growth  of  the  population  N(t)  of  a  single  species 
whose  members  compete  among  themselves  for  a  limited  amount  of  food 
and  living  space.  Recall  (see  Section  1.5)  that  N(t)  approaches  the  limiting 
population  K  =  a/ b,  as  /  approaches  infinity.  This  limiting  population  can 
be  thought  of  as  the  maximum  population  of  the  species  which  the  micro¬ 
cosm  can  support.  In  terms  of  K ,  the  logistic  law  (1)  can  be  rewritten  in  the 
form 

(2) 

Equation  (2)  has  the  following  interesting  interpretation.  When  the 
population  N  is  very  low,  it  grows  according  to  the  Malthusian  law  dN /dt 
=  aN.  The  term  aN  is  called  the  “biotic  potential”  of  the  species.  It  is  the 
potential  rate  of  increase  of  the  species  under  ideal  conditions,  and  it  is  re¬ 
alized  if  there  are  no  restrictions  on  food  and  living  space,  and  if  the  indi¬ 
vidual  members  of  the  species  do  not  excrete  any  toxic  waste  products.  As 
the  population  increases  though,  the  biotic  potential  is  reduced  by  the  fac¬ 
tor  (K—  N)/ K,  which  is  the  relative  number  of  still  vacant  places  in  the 
microcosm.  Ecologists  call  this  factor  the  environmental  resistance  to 
growth. 

Now,  let  Nx(t)  and  N2(t)  be  the  population  at  time  t  of  species  1  and  2 
respectively.  Further,  let  Kx  and  K2  be  the  maximum  population  of  species 
1  and  2  which  the  microcosm  can  support,  and  let  axNx  and  a2N2  be  the 
biotic  potentials  of  species  1  and  2.  Then,  Nx(t)  and  N2(t)  satisfy  the  sys- 
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tem  of  differential  equations 


dNx 

dt 


=  a{Nx 


Kx-Nx-  m2 


dNj 

dt 


—  a2N2 


K2  —  N  2  —  mx 
K2 


where  m2  is  the  total  number  of  places  of  the  first  species  which  are  taken 
up  by  members  of  the  second  species,  and  mx  is  the  total  number  of  places 
of  the  second  species  which  are  taken  up  by  members  of  the  first  species. 
At  first  glance  it  would  appear  that  m2  =  N2  and  mx  =  Nx.  However,  this  is 
not  generally  the  case,  for  it  is  highly  unlikely  that  two  species  utilize  the 
environment  in  identical  ways.  Equal  numbers  of  individuals 'of  species  1 
and  2  do  not,  on  the  average,  consume  equal  quantities  of  food,  take  up 
equal  amounts  of  living  space  and  excrete  equal  amounts  of  waste  prod¬ 
ucts  of  the  same  chemical  composition.  In  general,  we  must  set  m2  =  aN2 
and  mx  =  f$N x,  for  some  constants  a  and  /?.  The  constants  a  and  /?  indicate 
the  degree  of  influence  of  one  species  upon  the  other.  If  the  interests  of  the 
two  species  do  not  clash,  and  they  occupy  separate  niches,  then  both  a  and 
ft  are  zero.  If  the  two  species  lay  claim  to  the  same  niche  and  are  very  sim¬ 
ilar,  then  a  and  ft  are  very  close  to  one.  On  the  other  hand,  if  one  of  the 
species,  say  species  2,  utilizes  the  environment  very  unproductively;  i.e.,  it 
consumes  a  great  deal  of  food  or  excretes  very  poisonous  waste  products, 
then  one  individual  of  species  2  takes  up  the  place  of  many  individuals  of 
species  1.  In  this  case,  then,  the  coefficient  a  is  very  large. 

We  restrict  ourselves  now  to  the  case  where  the  two  species  are  nearly 
identical,  and  lay  claim  to  the  same  niche.  Then,  a  =  /3=  1,  and  Nx(t)  and 
N2(t)  satisfy  the  system  of  differential  equations 


dNx 

dt 


~  axNx 


Kx  —  Nx  —  N2 
Kx 


dN2 

dt 


=  a2N2 


K2-Nx-N2 

k2 


In  this  instance,  we  expect  the  struggle  for  existence  between  species  1  and 
2  to  be  very  intense,  and  to  result  in  the  extinction  of  one  of  the  species. 
This  is  indeed  the  case  as  we  now  show. 


Theorem  6  (Principle  of  competitive  exclusion).  Suppose  that  Kx  is  greater 
than  K2.  Then ,  every  solution  Nx(t),  N2(t)  of  (4)  approaches  the 
equilibrium  solution  NX  =  KX ,  N2  =  0  as  t  approaches  infinity.  In  other 
words ,  if  species  1  and  2  are  very  nearly  identical ,  and  the  microcosm  can 
support  more  members  of  species  1  than  of  species  2,  then  species  2  will 
ultimately  become  extinct. 


Our  first  step  in  proving  Theorem  6  is  to  show  that  Nx(t)  and  N2(t)  can 
never  become  negative.  To  this  end,  recall  from  Section  1.5  that 


Nx(t)  = 


K,NX  (0) 

Nx  (0)  +  (Kx-Nx(0))e-‘»t’ 


N2(t)= 0 
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is  a  solution  of  (4)  for  any  choice  of  A^O).  The  orbit  of  this  solution  in  the 
Nx-N2  plane  is  the  point  (0,0)  for  Nx(0)  =  0;  the  line  0<N{<Kl9  N2  =  0  for 
0<  jV,(0)< Kx;  the  point  (A^O)  for  Nx(0)=Kx;  and  the  line  A'1<Ar,<oo, 
W2  =  0  for  Nx(0)>  Kx.  Thus,  the  Nx  axis,  for  Nx  >  0,  is  the  union  of  four  dis¬ 
tinct  orbits.  Similarly,  the  N2  axis,  for  N2>  0,  is  the  union  of  four  dis¬ 
tinct  orbits  of  (4).  This  implies  that  all  solutions  #,(/),  N2(t)  of  (4)  which 
start  in  the  first  quadrant  (Nx  >  0,  N2  >  0)  of  the  Nx-N2  plane  must  remain 
there  for  all  future  time. 

Our  second  step  in  proving  Theorem  6  is  to  split  the  first  quadrant  into 
regions  in  which  both  dNx/ dt  and  dN2/dt  have  fixed  signs.  This  is  accom¬ 
plished  in  the  following  manner.  Let  /,  and  l2  be  the  lines  Kx  —  Nx  —  N2  =  0 
and  K2  —  Nx  —  #2  =  0,  respectively.  Observe  that  dNx/dt  is  negative  if 
(#1,Ar2)  lies  above  lx,  and  positive  if  (NX,N2)  lies  below  lx.  Similarly, 
dN2/dt  is  negative  if  (NX,N2)  lies  above  /2,  and  positive  if  (A^A^)  lies  be¬ 
low  /2.  Thus,  the  two  parallel  lines  lx  and  l2  split  the  first  quadrant  of  the 
Nx-N2  plane  into  three  regions  (see  Figure  1)  in  which  both  dNx/dt  and 
dN2/dt  have  fixed  signs.  Both  Nx(t)  and  N2(t)  increase  with  time  (along 
any  solution  of  (4))  in  region  I;  Nx(t)  increases,  and  N2(t)  decreases,  with 
time  in  region  II;  and  both  Nx(t)  and  N2(t)  decrease  with  time  in  region 
III. 


Lemma  1  .Any  solution  Nx(t ),  N2(t)  of  (4)  which  starts  in  region  I  at  t  =  t0 
must  leave  this  region  at  some  later  time . 

Proof.  Suppose  that  a  solution  Nx(t),  N2(t )  of  (4)  remains  in  region  I  for 
all  time  t  >  t0.  This  implies  that  both  Nx(t)  and  N2(t)  are  monotonic  in¬ 
creasing  functions  of  time  for  t  >  t0 ,  with  Nx(t)  and  N2(t)  less  than  K2. 
Consequently,  by  Lemma  1  of  Section  4.8,  both  Nx(t)  and  N2(t)  have  limits 
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£, rj  respectively,  as  t  approaches  infinity.  Lemma  2  of  Section  4.8  implies 
that  (£,tj)  is  an  equilibrium  point  of  (4).  Now,  the  only  equilibrium  points 
of  (4)  are  (0,0),  (A^O),  and  (0,  A2),  and  (£,rj)  obviously  cannot  equal  any  of 
these  three  points.  We  conclude  therefore,  that  any  solution  Nft),  N2{t)  of 
(4)  which  starts  in  region  I  must  leave  this  region  at  a  later  time.  □ 

Lemma  2.  Any  solution  N  ft ),  N2(t)  of  (4)  which  starts  in  region  II  at  time 
t=  t0  will  remain  in  this  region  for  all  future  time  t  >  t0,  and  ultimately  ap¬ 
proach  the  equilibrium  solution  NX  =  KX,  N2  =  0. 

Proof.  Suppose  that  a  solution  Nft),  N2(t)  of  (4)  leaves  region  II  at  time 
/  =  /*.  Then,  either  #,(/*)  or  N2(/*)  *s  zero,  since  the  only  way  a  solution  of 
(4)  can  leave  region  II  is  by  crossing  lx  or  /2.  Assume  that  A)(/*)  =  0.  Dif¬ 
ferentiating  both  sides  of  the  first  equation  of  (4)  with  respect  to  t  and  set¬ 
ting  t  =  t*  gives 

d2Nx(t*)  _  -axNx(t*)  dN2(t*) 
dt2  Kx  dT~  * 

This  quantity  is  positive.  Hence,  Nft)  has  a  minimum  at  t  =  t*.  But  this  is 
impossible,  since  2V'1(^)  is  increasing  whenever  a  solution  Nft),  N2(t )  of  (4) 
is  in  region  II.  Similarly,  if  A2(/*)  =  0,  then 

d2N2{t*)  _  -a2N2(t*)  dN ,  (/*) 

dt 2  K2  dt  • 

This  quantity  is  negative,  implying  that  N2{t)  has  a  maximum  at  t  =  t*.  But 
this  is  impossible,  since  N2(t)  is  decreasing  whenever  a  solution  Nft),  N2{t) 
of  (4)  is  in  region  II. 

The  previous  argument  shows  that  any  solution  Nft),  N2(t )  of  (4)  which 
starts  in  region  II  at  time  t  =  t0  will  remain  in  region  II  for  all  future  time 
t  >  t0.  This  implies  that  Nft)  is  monotonic  increasing  and  N2(t)  is  mono¬ 
tonic  decreasing  for  t>t0 ,  with  A^1(0<AT1  and  N2(t)>K2.  Consequently, 
by  Lemma  1  of  Section  4.8,  both  Nft)  and  N2(t)  have  limits  |,tj  respec¬ 
tively,  as  t  approaches  infinity.  Lemma  2  of  Section  4.8  implies  that  (£,tj)  is 
an  equilibrium  point  of  (4).  Now,  (£,rj)  obviously  cannot  equal  (0,0)  or 
(0, K2).  Consequently,  ($,rj)  =  (Klf 0),  and  this  proves  Lemma  2.  □ 

Lemma  3.  Any  solution  Nft),  N2(t)  of  (4)  which  starts  in  region  III  at  time 
t  =  t0  and  remains  there  for  all  future  time  must  approach  the  equilibrium 
solution  Nx(t)  =  Kl9  N2(t)  =  0  as  t  approaches  infinity. 

Proof.  If  a  solution  Nft),  N2(t)  of  (4)  remains  in  region  III  for  t  >  t0,  then 
both  Nft)  and  N2(t)  are  monotonic  decreasing  functions  of  time  for  t  >  t0, 
with  Nft)>0  and  A^2(/)>0.  Consequently,  by  Lemma  1  of  Section  4.8, 
both  A^j(/)  and  N2(t)  have  limits  respectively,  as  t  approaches  infinity. 
Lemma  2  of  Section  4.8  implies  that  (£,tj)  is  an  equilibrium  point  of  (4). 
Now,  (£,tj)  obviously  cannot  equal  (0,0)  or  (0,  K2).  Consequently,  (£,  tj)  = 
(*,>0).  □ 
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Proof  of  theorem  6.  Lemmas  1  and  2  above  state  that  every  solution 
Nx(t),  N2(t)  of  (4)  which  starts  in  regions  I  or  II  at  time  t  =  t0  must  ap¬ 
proach  the  equilibrium  solution  NX  =  KX,  iV2  =  0  as  t  approaches  infinity. 
Similarly,  Lemma  3  shows  that  every  solution  Nx(t\  N2(t)  of  (4)  which 
starts  in  region  III  at  time  t  =  t0  and  remains  there  for  all  future  time  must 
also  approach  the  equilibrium  solution  NX  =  KX,  N2  =  0.  Next,  observe  that 
any  solution  Nx(t),  N2(t)  of  (4)  which  starts  on  /,  or  l2  must  immediately 
afterwards  enter  region  II.  Finally,  if  a  solution  Nx(t),N2(t)  of  (4)  leaves  re¬ 
gion  III,  then  it  must  cross  the  line  lx  and  immediately  afterwards  enter  re¬ 
gion  II.  Lemma  2  then  forces  this  solution  to  approach  the  equilibrium 
solution  Nx  =  Kx,  N2  =  0.  □ 

Theorem  6  deals  with  the  case  of  identical  species;  i.e.,  a  =  f$=  1.  By  a 
similar  analysis  (see  Exercises  4-6)  we  can  predict  the  outcome  of  the 
struggle  for  existence  for  all  values  of  a  and  /?. 

Reference 

Gause,  G.  F.,  ‘The  Struggle  for  Existence,’  Dover  Publications,  New  York,  1964. 


Exercises 


1.  Rewrite  the  system  of  equations  (4)  in  the  form 


K  i 
axNx 


dNx 

dt 


=  KX-NX-N2, 


K2  dN2 
a2N2  dt 


=  K2-Nx-N2. 


Then,  subtract  these  two  equations  and  integrate  to  obtain  directly  that  N2(t) 
approaches  zero  for  all  solutions  Nx(t),  N2(t)  of  (4)  with  Nx(to)>0. 


2.  The  system  of  differential  equations 

dNx 

-3T =  Nx[-ax  +  cx(\-bxNx-b2N2 )] 

dN2  K  } 

is  a  model  of  two  species  competing  for  the  same  limited  resource.  Suppose  that 
Cj  >  ax  and  c2  >  a2.  Deduce  from  Theorem  6  that  Nx(t)  ultimately  approaches 
zero  if  axc2>a2ch  and  N2(t)  ultimately  approaches  zero  if  axc2<a2cx. 

3.  In  1926,  Volterra  presented  the  following  model  of  two  species  competing  for 
the  same  limited  food  supply: 

dNx 

-3p=[i1-X1(/lliV1  +  /.2JV2)]Ar1 
dNy 

-^=[*2-A2(/«,^i  +  X2iV2)]^2. 

Suppose  that  bx/\ i  >62/A2.  (The  coefficient  bj\  is  called  the  susceptibility  of 
species » to  food  shortages.)  Prove  that  species  2  will  ultimately  become  extinct  if 

iVi('o)>0. 


456 


4. 1 1  The  principle  of  competitive  exclusion  in  population  biology 


Problems  4-6  are  concerned  with  the  system  of  equations 


dNx 

dt 


a\N\ 


(*  i  -Nx-  olN2  ), 


dNi 

dt 


d'2^2 


{K2-N2-$NX). 


4.  (a)  Assume  that  Kx/a>K2  and  K2/ P<KX.  Show  that  N2(t)  approaches  zero  as 

/  approaches  infinity  for  every  solution  Nx(t ),  N2(t)  of  (*)  with  Nx(t0)>0. 
(b)  Assume  that  Kx/ a  <K2  and  K2/ P  >  Kx.  Show  that  Nx(t)  approaches  zero  as 
t  approaches  infinity  for  every  solution  Nx(t),  N2(t)  of  (*)  with  NxN2(t0)>0. 
Hint :  Draw  the  lines  lx :  Nx  +  aN2  =  Kx  and  l2 :  N2  +  PNX  =  K2,  and  follow  the 
proof  of  Theorem  6. 

5.  Assume  that  Kx/a>  K2  and  K2/ p  >  Kx.  Prove  that  all  solutions  Nx(t ),  N2(t)  of 
(*),  with  both  Nx(t0)  and  N2(t0)  positive,  ultimately  approach  the  equilibrium 
solution 


Nx  =  Nx  = 


Kx  -  aK2 
1  -ap  ’ 


n2=n$= 


K2  -  pKl 
1  —  ap  ' 


Hint : 

(a)  Draw  the  lines  lx\Nx  +  aN2  =  Kx  and  l2 : N2  +  fiNx  =  K2.  The  two  lines  divide 
the  first  quadrant  into  four  regions  (see  Figure  2)  in  which  both  Nx  and  N2 
have  fixed  signs. 


(b)  Show  that  all  solutions  Nx(t ),  N2(t)  of  (*)  which  start  in  either  region  II  or 
III  must  remain  in  these  regions  and  ultimately  approach  the  equilibrium 
solution  Nx  =  Nx,  N2  =  N2. 

(c)  Show  that  all  solutions  Nx(t ),  N2(t)  of  (*)  which  remain  exclusively  in  region 
I  or  region  IV  for  all  time  t  >  t0  must  ultimately  approach  the  equilibrium 
solution  Nx  =  Nx ,  N2  =  N2. 
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Figure  3 


6.  Assume  that  Kx/ a  <K2  and  K2/ P  <  Kx. 

(a)  Show  that  the  equilibrium  solution  Nx  =  0,  N2  =  0  of  (*)  is  unstable. 

(b)  Show  that  the  equilibrium  solutions  NX  =  KX ,  N2  =  0  and  Nx=  0,  N2  =  K2  of 
(*)  are  asymptotically  stable. 

(c)  Show  that  the  equilibrium  solution  Nx  =  Nx,  N2  =  N2  (see  Exercise  5)  of  (*) 
is  a  saddle  point.  (This  calculation  is  very  cumbersome.) 

(d)  It  is  not  too  difficult  to  see  that  the  phase  portrait  of  (*)  must  have  the  form 
described  in  Figure  3. 


4.12  The  Threshold  Theorem  of  epidemiology 

Consider  the  situation  where  a  small  group  of  people  having  an  infectious 
disease  is  inserted  into  a  large  population  which  is  capable  of  catching  the 
disease.  What  happens  as  time  evolves?  Will  the  disease  die  out  rapidly,  or 
will  an  epidemic  occur?  How  many  people  will  ultimately  catch  the  dis¬ 
ease?  To  answer  these  questions  we  will  derive  a  system  of  differential 
equations  which  govern  the  spread  of  an  infectious  disease  within  a  popu¬ 
lation,  and  analyze  the  behavior  of  its  solutions.  This  approach  will  also 
lead  us  to  the  famous  Threshold  Theorem  of  epidemiology  which  states 
that  an  epidemic  will  occur  only  if  the  number  of  people  who  are  suscept¬ 
ible  to  the  disease  exceeds  a  certain  threshold  value. 

We  begin  with  the  assumptions  that  the  disease  under  consideration 
confers  permanent  immunity  upon  any  individual  who  has  completely  re¬ 
covered  from  it,  and  that  it  has  a  negligibly  short  incubation  period.  This 
latter  assumption  implies  that  an  individual  who  contracts  the  disease  be¬ 
comes  infective  immediately  afterwards.  In  this  case  we  can  divide  the 
population  into  three  classes  of  individuals:  the  infective  class  (/),  the  sus¬ 
ceptible  class  (S)  and  the  removed  class  (/?).  The  infective  class  consists  of 
those  individuals  who  are  capable  of  transmitting  the  disease  to  others. 
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The  susceptible  class  consists  of  those  individuals  who  are  not  infective, 
but  who  are  capable  of  catching  the  disease  and  becoming  infective.  The 
removed  class  consists  of  those  individuals  who  have  had  the  disease  and 
are  dead,  or  have  recovered  and  are  permanently  immune,  or  are  isolated 
until  recovery  and  permanent  immunity  occur. 

The  spread  of  the  disease  is  presumed  to  be  governed  by  the  following 
rules. 

Rule  1:  The  population  remains  at  a  fixed  level  N  in  the  time  interval 
under  consideration.  This  means,  of  course,  that  we  neglect  births,  deaths 
from  causes  unrelated  to  the  disease  under  consideration,  immigration  and 
emigration. 

Rule  2:  The  rate  of  change  of  the  susceptible  population  is  proportional 
to  the  product  of  the  number  of  members  of  (S')  and  the  number  of  mem¬ 
bers  of  (/). 

Rule  3:  Individuals  are  removed  from  the  infectious  class  (/)  at  a  rate 
proportional  to  the  size  of  (/). 

Let  S  (/),/(/),  and  R(t)  denote  the  number  of  individuals  in  classes  (S'), 
(/),  and  ( R ),  respectively,  at  time  t.  It  follows  immediately  from  Rules  1-3 
that  S(t),I(t),R(t)  satisfies  the  system  of  differential  equations 


dS 

dt 


=  —  rSl 


f 


dR 

dt 


for  some  positive  constants  r  and  y.  The  proportionality  constant  r  is 
called  the  infection  rate,  and  the  proportionality  constant  y  is  called  the  re¬ 
moval  rate. 

The  first  two  equations  of  (1)  do  not  depend  on  R.  Thus,  we  need  only 
consider  the  system  of  equations 


f— <2> 

for  the  two  unknown  functions  S  (!)  and  I(t).  Once  S  (!)  and  /(/)  are 
known,  we  can  solve  for  R(t )  from  the  third  equation  of  (1).  Alternately, 
observe  that  d(S  +  I+R)/dt  =  0.  Thus, 


S(t)  + 1  (t)  +  R(t)  =  constant = N 
so  that  R(t)  =  N-S(t)-I(t). 

The  orbits  of  (2)  are  the  solution  curves  of  the  first-order  equation 


dI  rSI-yl 
dS  -rSI 

Integrating  this  differential  equation  gives 

I  (S)  =  I0+  S0—  S  +  pin 


y 

rS' 

(3) 

S 

So’ 

(4) 
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I 


where  S0  and  70  are  the  number  of  susceptibles  and  infectives  at  the  initial 
time  f  =  f0,  and  p  =  y/r.  To  analyze  the  behavior  of  the  curves  (4),  we  com¬ 
pute  I'(S)  =  —  l+p/S.  The  quantity  -  1  +  p/  S  is  negative  for  S  >p,  and 
positive  for  S<p.  Hence,  I(S)  is  an  increasing  function  of  S'  for  S<p,  and 
a  decreasing  function  of  S  for  S  >  p. 

Next,  observe  that  7(0)=  — oo  and  7(So)  =  /o>0.  Consequently,  there 
exists  a  unique  point  S^,  with  0  <  S^  <  S0,  such  that  7(Soo)  =  0,  and  7(S) 
>0  for  <  S  <  S0.  The  point  (S^O)  is  an  equilibrium  point  of  (2)  since 
both  dS/dt  and  dl /  dt  vanish  when  7  =  0.  Thus,  the  orbits  of  (2),  for  t0<  t 
<  oo,  have  the  form  described  in  Figure  1. 

Let  us  see  what  all  this  implies  about  the  spread  of  the  disease  within 
the  population.  As  t  runs  from  t0  to  oo,  the  point  (S(t),I(t))  travels  along 
the  curve  (4),  and  it  moves  along  the  curve  in  the  direction  of  decreasing  S, 
since  S(t)  decreases  monotonically  with  time.  Consequently,  if  S0  is  less 
than  p,  then  7(f)  decreases  monotonically  to  zero,  and  S(t)  decreases 
monotonically  to  S 00.  Thus,  if  a  small  group  of  infectives  70  is  inserted  into 
a  group  of  susceptibles  S0 ,  with  S0  <  p,  then  the  disease  will  die  out  rapidly. 
On  the  other  hand,  if  S0  is  greater  than  p,  then  7(f)  increases  as  S(t)  de¬ 
creases  to  p,  and  it  achieves  a  maximum  value  when  S  =  p.  It  only  starts 
decreasing  when  the  number  of  susceptibles  falls  below  the  threshold  value 
p.  From  these  results  we  may  draw  the  following  conclusions. 

Conclusion  1 :  An  epidemic  will  occur  only  if  the  number  of  susceptibles 
in  a  population  exceeds  the  threshold  value  p  =  y/r. 

Conclusion  2:  The  spread  of  the  disease  does  not  stop  for  lack  of  a  sus¬ 
ceptible  population;  it  stops  only  for  lack  of  infectives.  In  particular,  some 
individuals  will  escape  the  disease  altogether. 

Conclusion  1  corresponds  to  the  general  observation  that  epidemics 
tend  to  build  up  more  rapidly  when  the  density  of  susceptibles  is  high  due 
to  overcrowding,  and  the  removal  rate  is  low  because  of  ignorance,  inade¬ 
quate  isolation  and  inadequate  medical  care.  On  the  other  hand,  outbreaks 
tend  to  be  of  only  limited  extent  when  good  social  conditions  entail  lower 
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densities  of  susceptibles,  and  when  removal  rates  are  high  because  of  good 
public  health  vigilance  and  control. 

If  the  number  of  susceptibles  S0  is  initially  greater  than,  but  close  to,  the 
threshold  value  p,  then  we  can  estimate  the  number  of  individuals  who 
ultimately  contract  the  disease.  Specifically,  if  S0  —  p  is  small  compared  to 
p,  then  the  number  of  individuals  who  ultimately  contract  the  disease  is  ap¬ 
proximately  2(S0  —  p).  This  is  the  famous  Threshold  Theorem  of  epidemiol¬ 
ogy,  which  was  first  proven  in  1927  by  the  mathematical  biologists 
Kermack  and  McKendrick. 


Theorem  7  (Threshold  Theorem  of  epidemiology).  Let  S0  =  p  +  v  and 
assume  that  v/p  is  very  small  compared  to  one.  Assume  moreover ,  that  the 
number  of  initial  infectives  70  is  very  small.  Then ,  the  number  of  individu¬ 
als  who  ultimately  contract  the  disease  is  2v.  In  other  words ,  the  level  of 
susceptibles  is  reduced  to  a  point  as  far  below  the  threshold  as  it  originally 
was  above  it. 

Proof.  Letting  t  approach  infinity  in  (4)  gives 

0=/0+50-5oo  +  pln^. 

If  I0  is  very  small  compared  to  S0 ,  then  we  can  neglect  it,  and  write 

O^o-^  +  pln^ 


=  S0- S^  +  pln 


So-iSo-S^) 

So 


~  S0  5^  + pin 


1 


So-Soc 


>0 


Now,  if  S0  —  p  is  small  compared  to  p.  then  S0-  will  be  small  compared 
to  S0.  Consequently,  we  can  truncate  the  Taylor  series 


In 


1- 


So-Soc 


*0 


Sq  S  qo 


l/So-Soo 


+  . . . 


'0 


‘0 


after  two  terms.  Then, 


0  —  Sq  S to  p 


So-Soo 


p  /  ^0 


00 


‘0 


So 


=  (So-SM) 


P 

2Si 


(So-S.) 


461 


4  Qualitative  theory  of  differential  equations 


Solving  for  SQ-  we  see  that 


S0-S 


00 


p+V 

P 


During  the  course  of  an  epidemic  it  is  impossible  to  accurately  ascertain 
the  number  of  new  infectives  each  day  or  week,  since  the  only  infectives 
who  can  be  recognized  and  removed  from  circulation  are  those  who  seek 
medical  aid.  Public  health  statistics  thus  record  only  the  number  of  new  re¬ 
movals  each  day  or  week,  not  the  number  of  new  infectives.  Therefore,  in 
order  to  compare  the  results  predicted  by  our  model  with  data  from  actual 
epidemics,  we  must  find  the  quantity  dR/dt  as  a  function  of  time.  This  is 
accomplished  in  the  following  manner.  Observe  first  that 

^-=yI=y(N-R-S). 


Second,  observe  that 

dS  _  dS/dt  ^  —  rSI  _  -  S 
dR  dR  /  dt  yl  p 


Hence,  S (R)  =  S0e  R/p  and 

^  =  y(N-R-S0e-«/‘’ ).  (5) 


Equation  (5)  is  separable,  but  cannot  be  solved  explicitly.  However,  if  the 
epidemic  is  not  very  large,  then  R/p  is  small  and  we  can  truncate  the 
Taylor  series 


R/p 


P  2  \p  J 
after  three  terms.  With  this  approximation, 


dR 

dt 


N—R  —  S0 


l-R/p+±(R/p)2 


N—  S0+ 


The  solution  of  this  equation  is 


1  +  a  tanh 


(\ayt- 
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where 


a  = 


So_iy+2  S0(N-S0) 
9 


1/2 


<£>  =  tanh  1  —  (  —  —  1 
a  \  p 


and  the  hyperbolic  tangent  function  tanhz  is  defined  by 


tanhz  = 


ez  —  e  2 


ez  +  e 


z  * 


It  is  easily  verified  that 

4-  tanh  z  =  sech2  z  =  ^ 


dz 


( ez  +  e~zy 


Hence, 


dR 

dt 


Equation  (7)  defines  a  symmetric  bell  shaped  curve  in  the  t-dR/dt  plane 
(see  Figure  2).  This  curve  is  called  the  epidemic  curve  of  the  disease.  It 
illustrates  very  well  the  common  observation  that  in  many  actual  epidem¬ 
ics,  the  number  of  new  cases  reported  each  day  climbs  to  a  peak  value  and 
then  dies  away  again. 


_ i _ 

2  <f>/ar 


Figure  2 

Kermack  and  McKendrick  compared  the  values  predicted  for  dR/ dt 
from  (7)  with  data  from  an  actual  plague  in  Bombay  which  spanned  the 
last  half  of  1905  and  the  first  half  of  1906.  They  set 

^  =890sech2(0.2/-3.4) 

with  t  measured  in  weeks,  and  compared  these  values  with  the  number  of 
deaths  per  week  from  the  plague.  This  quantity  is  a  very  good  approxima¬ 
tion  of  dR/dt,  since  almost  all  cases  terminated  fatally.  As  can  be  seen 
from  Figure  3,  there  is  excellent  agreement  between  the  actual  values  of 
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dR/dt,  denoted  by  •,  and  the  values  predicted  by  (7).  This  indicates,  of 
course,  that  the  system  of  differential  equations  (1)  is  an  accurate  and  reli¬ 
able  model  of  the  spread  of  an  infectious  disease  within  a  population  of 
fixed  size. 
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Exercises 

1.  Derive  Equation  (6). 

2.  Suppose  that  the  members  of  (S')  are  vaccinated  against  the  disease  at  a  rate  X 
proportional  to  their  number.  Then, 

f.-rSI-XS,  %-rSI-lI.  C) 

(a)  Find  the  orbits  of  (*). 

(b)  Conclude  from  (a)  that  S(t)  approaches  zero  as  t  approaches  infinity,  for  ev¬ 
ery  solution  S(t ),  7(0  of  (*). 
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3.  Suppose  that  the  members  of  (S')  are  vaccinated  against  the  disease  at  a  rate  X 
proportional  to  the  product  of  their  numbers  and  the  square  of  the  members  of 
(/).  Then, 

^  =  —  rSI—XSI2,  % =i(rS-y ).  (*) 

(a)  Find  the  orbits  of  (*). 

(b)  Will  any  susceptibles  remain  after  the  disease  dies  out? 

4.  The  intensity  i  of  an  epidemic  is  the  proportion  of  the  total  number  of  suscept¬ 
ibles  that  finally  contracts  the  disease.  Show  that 

Jo+so~Sao 

1  So 

where  is  a  root  of  the  equation 

S  =  S0e(s-s0-/0)/p 

5.  Compute  the  intensity  of  the  epidemic  if  p=  1000,  70=  10,  and  (a)  S0=  1 100,  (b) 
S0=  1200,  (c)  S0=  1300,  (d)  S0=  1500,  (e)  S0=  1800,  (0  S0=  1900.  (This  cannot 
be  done  analytically.) 

6.  Let  R oo  denote  the  total  number  of  individuals  who  contract  the  disease. 

(a)  Show  that  RO0  =  I0+  S0—  S^. 

(b)  Let  R]  denote  the  members  of  (R)  who  are  removed  from  the  population 
prior  to  the  peak  of  the  epidemic.  Compute  Rx/ R^  for  each  of  the  values  of 
S0  in  5a-5f.  Notice  that  most  of  the  removals  occur  after  the  peak.  This  type 
of  asymmetry  is  often  found  in  actual  notifications  of  infectious  diseases. 

7.  It  was  observed  in  London  during  the  early  1900’s,  that  large  outbreaks  of 
measles  epidemics  recurred  about  once  every  two  years.  The  mathematical  biol¬ 
ogist  H.  E.  Soper  tried  to  explain  this  phenomenon  by  assuming  that  the  stock 
of  susceptibles  is  constantly  replenished  by  new  recruits  to  the  population.  Thus, 
he  assumed  that 

f  f  ~'si-y  <*) 

for  some  positive  constants  r,  y,  and  p. 

(a)  Show  that  S=  y/r,  7  =  p/y  is  the  only  equilibrium  solution  of  (*). 

(b)  Show  that  every  solution  S(t),  7(f)  of  (*)  which  starts  sufficiently  close  to 
this  equilibrium  point  must  ultimately  approach  it  as  t  approaches  infinity. 

(c)  It  can  be  shown  that  every  solution  S(t),  7(f)  of  (*)  approaches  the 
equilibrium  solution  S  =  y/r,  7=/x/y  as  t  approaches  infinity.  Conclude, 
therefore,  that  the  system  (*)  does  not  predict  recurrent  outbreaks  of  measles 
epidemics.  Rather,  it  predicts  that  the  disease  will  ultimately  approach  a 
steady  state. 

4.13  A  model  for  the  spread  of  gonorrhea 

Gonorrhea  ranks  first  today  among  reportable  communicable  diseases  in 
the  United  States.  There  are  more  reported  cases  of  gonorrhea  every  year 
than  the  combined  totals  for  syphilis,  measles,  mumps,  and  infectious 
hepatitis.  Public  health  officials  estimate  that  more  than  2,500,000  Ameri- 
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cans  contract  gonorrhea  every  year.  This  painful  and  dangerous  disease, 
which  is  caused  by  the  gonococcus  germ,  is  spread  from  person  to  person 
by  sexual  contact.  A  few  days  after  the  infection  there  is  usually  itching 
and  burning  of  the  genital  area,  particularly  while  urinating.  About  the 
same  time  a  discharge  develops  which  males  will  notice,  but  which  females 
may  not  notice.  Infected  women  may  have  no  easily  recognizable  symp¬ 
toms,  even  while  the  disease  does  substantial  internal  damage.  Gonorrhea 
can  only  be  cured  by  antibiotics  (usually  penicillin).  However,  treatment 
must  be  given  early  if  the  disease  is  to  be  stopped  from  doing  serious 
damage  to  the  body.  If  untreated,  gonorrhea  can  result  in  blindness,  steril¬ 
ity,  arthritis,  heart  failure,  and  ultimately,  death. 

In  this  section  we  construct  a  mathematical  model  of  the  spread  of 
gonorrhea.  Our  work  is  greatly  simplified  by  the  fact  that  the  incubation 
period  of  gonorrhea  is  very  short  (3-7  days)  compared  to  the  often  quite 
long  period  of  active  infectiousness.  Thus,  we  will  assume  in  our  model 
that  an  individual  becomes  infective  immediately  after  contracting  gonor¬ 
rhea.  In  addition,  gonorrhea  does  not  confer  even  partial  immunity  to 
those  individuals  who  have  recovered  from  it.  Immediately  after  recovery, 
an  individual  is  again  susceptible.  Thus,  we  can  split  the  sexually  active 
and  promiscuous  portion  of  the  population  into  two  groups,  susceptibles 
and  infectives.  Let  cx(t)  be  the  total  number  of  promiscuous  males,  c2(t) 
the  total  number  of  promiscuous  females,  x(t)  the  total  number  of  infec¬ 
tive  males,  andy(/)  the  total  number  of  infective  females,  at  time  t.  Then, 
the  total  numbers  of  susceptible  males  and  susceptible  females  are  cx{t)  — 
x(t)  and  c2(t)—y(t)  respectively.  The  spread  of  gonorrhea  is  presumed  to 
be  governed  by  the  following  rules: 

1.  Male  infectives  are  cured  at  a  rate  ax  proportional  to  their  total  num¬ 
ber,  and  female  infectives  are  cured  at  a  rate  a2  proportional  to  their  total 
number.  The  constant  ax  is  larger  than  a2  since  infective  males  quickly  de¬ 
velop  painful  symptoms  and  therefore  seek  prompt  medical  attention. 
Female  infectives,  on  the  other  hand,  are  usually  asymptomatic,  and  there¬ 
fore  are  infectious  for  much  longer  periods. 

2.  New  infectives  are  added  to  the  male  population  at  a  rate  bx  propor¬ 
tional  to  the  total  number  of  male  susceptibles  and  female  infectives.  Simi¬ 
larly,  new  infectives  are  added  to  the  female  population  at  a  rate  b2  pro¬ 
portional  to  the  total  number  of  female  susceptibles  and  male  infectives. 

3.  The  total  numbers  of  promiscuous  males  and  promiscuous  females  re¬ 
main  at  constant  levels  cx  and  c2,  respectively. 

It  follows  immediately  from  rules  1-3  that 


dx 

dt 


—  axx  +  bx(cx  —  x)y 


dt 


-a2y  +  b2(c2-y)x. 
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Remark.  The  system  of  equations  (1)  treats  only  those  cases  of  gonorrhea 
which  arise  from  heterosexual  contacts;  the  case  of  homosexual  contacts 
(assuming  no  interaction  between  heterosexuals  and  homosexuals)  is 
treated  in  Exercises  5  and  6.  The  number  of  cases  of  gonorrhea  which  arise 
from  homosexual  encounters  is  a  small  percentage  of  the  total  number  of 
incidents  of  gonorrhea.  Interestingly  enough,  this  situation  is  completely 
reversed  in  the  case  of  syphilis.  Indeed,  more  than  90%  of  all  cases  of 
syphilis  reported  in  the  state  of  Rhode  Island  during  1973  resulted  from 
homosexual  encounters.  (This  statistic  is  not  as  startling  as  it  first  appears. 
Within  ten  to  ninety  days  after  being  infected  with  syphilis,  an  individual 
usually  develops  a  chancre  sore  at  the  spot  where  the  germs  entered  the 
body.  A  homosexual  who  contracts  syphilis  as  a  result  of  anal  intercourse 
with  an  infective  will  develop  a  chancre  sore  on  his  rectum.  This  individ¬ 
ual,  naturally,  will  be  reluctant  to  seek  medical  attention,  since  he  will  then 
have  to  reveal  his  identity  as  a  homosexual.  Moreover,  he  feels  no  sense  of 
urgency,  since  the  chancre  sore  is  usually  painless  and  disappears  after 
several  days.  With  gonorrhea,  on  the  other  hand,  the  symptoms  are  so 
painful  and  unmistakable  that  a  homosexual  will  seek  prompt  medical 
attention.  Moreover,  he  need  not  reveal  his  identity  as  a  homosexual  since 
the  symptoms  of  gonorrhea  appear  in  the  genital  area.) 

Our  first  step  in  analyzing  the  system  of  differential  equations  (1)  is  to 
show  that  they  are  realistic.  Specifically,  we  must  show  that  x(t)  andy(0 
can  never  become  negative,  and  can  never  exceed  cx  and  c2,  respectively. 
This  is  the  content  of  Lemmas  1  and  2. 

Lemma  1.  If  x(t0)  and  y(t0)  are  positive ,  then  x(t)  and  y(t)  are  positive  for 
all  t  >  t0. 

Lemma  2.  If  x(t0)  is  less  than  c,  and  y(t0)  is  less  than  c2,  then  x(t)  is  less 
than  cx  and y(t)  is  less  than  c2for  all  t  >  t0. 

Proof  of  lemma  1.  Suppose  that  Lemma  1  is  false.  Let  t*>t0  be  the  first 

time  at  which  either  x  or  y  is  zero.  Assume  that  a;  is  zero  first.  Then, 

evaluating  the  first  equation  of  (1)  at  t  =  t*  gives  x(t*)  =  bxcxy(t*).  This 
quantity  is  positive.  (Note  that  >>(/*)  cannot  equal  zero  since  jc  =  0,  y  =  0  is 
an  equilibrium  solution  of  (1).)  Hence,  x(t)  is  less  than  zero  for  t  close  to, 
and  less  than  /*.  But  this  contradicts  our  assumption  that  t*  is  the  first 
time  at  which  x{t )  equals  zero.  We  run  into  the  same  contradiction  iiy(t*) 
=  0.  Thus,  both  x{t)  andy(/)  are  positive  for  t  >  t0.  □ 

Proof  of  lemma  2.  Suppose  that  Lemma  2  is  false.  Let  /*  >  t0  be  the  first 

time  at  which  either  x  =  cl9  or  y  =  c2.  Suppose  that  x(t*)  =  cx.  Evaluating 
the  first  equation  of  (1)  at  t  =  t*  gives  x(t*)=  —  axcx.  This  quantity  is  nega¬ 
tive.  Hence,  x{t)  is  greater  than  cx  for  t  close  to,  and  less  than  t*.  But  this 
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contradicts  our  assumption  that  t*  is  the  first  time  at  which  x(f)  equals  cx. 
We  run  into  the  same  contradiction  if  j>(7*)  =  c2.  Thus,  x(t)  is  less  than  cx 
and^(f)  is  less  than  c2  for  t>  t0.  □ 

Having  shown  that  the  system  of  equations  (1)  is  a  realistic  model  of 
gonorrhea,  we  now  see  what  predictions  it  makes  concerning  the  future 
course  of  this  disease.  Will  gonorrhea  continue  to  spread  rapidly  and  un¬ 
controllably  as  the  data  in  Figure  1  seems  to  suggest,  or  will  it  level  off 
eventually?  The  following  extremely  important  theorem  of  epidemiology 
provides  the  answer  to  this  question. 

Theorem  8. 

(a)  Suppose  that  axa2  is  less  than  bxb2cxc2.  Then ,  every  solution  x(t), 
y( t)  of  (1)  with  0<x(to)<cx  and  0<>>(/0)<c2,  approaches  the 
equilibrium  solution 

__  bxb2cxc2  —  axa2  bxb2cxc2  — axa2 

a\b2+  blb2c2  ’  y~  a2bi  +  b{b2c{ 

as  t  approaches  infinity.  In  other  words ,  the  total  numbers  of  infective 
males  and  infective  females  will  ultimately  level  off. 

Qo) Suppose  that  axa2  is  greater  than  bxb2cxc2.  Then  every  solution  x  (0. 
y(t)  of  (1)  with  0<x(/o)<c1  and  0  <c29  approaches  zero  as  t  ap¬ 

proaches  infinity.  In  other  words ,  gonorrhea  will  ultimately  die  out. 


Our  first  step  in  proving  part  (a)  of  Theorem  8  is  to  split  the  rectangle 
0  <  x  <  Cj,  0<>><c2  into  regions  in  which  both  dx/dt  and  dy/dt  have 
fixed  signs.  This  is  accomplished  in  the  following  manner.  Setting  dx/dt  — 
0  in  (1),  and  solving  for  y  as  a  function  of  x  gives 


y  = 


axx 

b\{c\  —  x) 


=<>i(x). 


Similarly,  setting  dy/dt  —  0  in  (1)  gives 


q2y 

b2(c2-y )  ’ 


or 


b2c2x 
a2  +  b2x 


<f>2(x). 


Observe  first  that  <f>x(x)  and  <j>2(x)  are  monotonic  increasing  functions  of  x ; 
</>j(x)  approaches  infinity  as  x  approaches  cx ,  and  <f>2(x)  approaches  c2  as  x 
approaches  infinity.  Second,  observe  that  the  curves  y  =  <j>x(x)  and y  =  <j>2(x) 
intersect  at  (0,0)  and  at  (x0,y0)  where 

_  bxb2cxc2  —  ala2  bib2cxc2- 

d\b2  + blb2c2  y°  a2bi  +  bib2ci 
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Figure  1.  Reported  cases  of  gonorrhea,  in  thousands,  for  1950-1973. 
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Figure  2 


Third,  observe  that  <j>2(x)  is  increasing  faster  than  ^(x)  at  Jt  =  0,  since 

byCy 

*2(0)= =*',((>). 
a2  °\c\ 

Hence,  <j>2(x)  lies  above  <j>i(x)  for  0<  x  <  x0,  and  <t>2(x)  lies  below  <j>x(x)  for 
x0<  x  <  Cj,  as  shown  in  Figure  2.  The  point  (x0, >>0)  is  an  equilibrium  point 
of  (1)  since  both  dx  /  dt  and  dy  /  dt  are  zero  when  x  =  x0  and  y  =y0. 

Finally,  observe  that  dx/dt  is  positive  at  any  point  ( x,y )  above  the 
curve  >>  =  <#>j(x),  and  negative  at  any  point  ( x,y )  below  this  curve.  Similarly, 
dy / dt  is  positive  at  any  point  (x,y)  below  the  curves =</>2( x),  and  negative 
at  any  point  (x,y)  above  this  curve.  Thus,  the  curves  >y  =  </>1(x)  and>>  = 
<t>2(x)  split  the  rectangle  0<x<C!,  0<^<c2  into  four  regions  in  which 
dx/dt  and  dy / dt  have  fixed  signs  (see  Figure  2). 

Next,  we  require  the  following  four  simple  lemmas. 

Lemma  3.  Any  solution  x(t),y(t)  of  ( 1)  which  starts  in  region  I  at  time  t  =  t0 
will  remain  in  this  region  for  all  future  time  t  >  t0  and  approach  the 
equilibrium  solution  x  =  x0,  y  =y0  as  t  approaches  infinity . 

Proof.  Suppose  that  a  solution  x{t\y{t)  of  (1)  leaves  region  I  at  time  t  = 
/*.  Then,  either  x(t*)  or  y(t*)  is  zero,  since  the  only  way  a  solution  of  (1) 
can  leave  region  I  is  by  crossing  the  curves =<|>1(x)  or y =<t>2(x).  Assume 
that  i(/*)  =  0.  Differentiating  both  sides  of  the  first  equation  of  (1)  with  re- 
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spect  to  t  and  setting  t  —  t*  gives 


d2x(t*) 
dt 2 


dt 


This  quantity  is  positive,  since  x(t*)  is  less  than  c„  and  dy /dt  is  positive  on 
the  curve  y  =  <l>\(x),  0<x<xo.  Hence,  x(7)  has  a  minimum  at  /  =  /*.  But 
this  is  impossible,  since  x(t )  is  increasing  whenever  the  solution  x(t),y(t)  is 
in  region  I.  Similarly,  ify(7*)  =  0,  then 


d2y(t*) 
dt 2 


b2(c2 


dx(t *) 
dt 


This  quantity  is  positive,  sincey(/*)  is  less  than  c2,  and  dx/dt  is  positive  on 
the  curve  y  =  <t>2(x),  0<x<x0.  Hence,  y(t)  has  a  minimum  at  t  =  t*.  But 
this  is  impossible,  sincey(0  is  increasing  whenever  the  solution  x(t),y(t)  is 
in  region  I. 

The  previous  argument  shows  that  any  solution  x(t),y(t)  of  (1)  which 
starts  in  region  I  at  time  t  =  t0  will  remain  in  region  I  for  all  future  time  t  > 
t0.  This  implies  that  x(t)  and  y(7)  are  monotonic  increasing  functions  of 
time  for  t>  t0 ,  with  *(/)<  xo  andy(/)<y0.  Consequently,  by  Lemma  1  of 
Section  4.8,  both  x(t)  andy(7)  have  limits  £,tj,  respectively,  as  t  approaches 
infinity.  Lemma  2  of  Section  4.8  implies  that  (|,tj)  is  an  equilibrium  point 
of  (1).  Now,  it  is  easily  seen  from  Figure  2  that  the  only  equilibrium  points 
of  (1)  are  (0,0)  and  (x;0,  y0).  But  (£,tj)  cannot  equal  (0,0)  since  both  x  (0 
and  >>(/)  are  increasing  functions  of  time.  Hence,  d,^)  =  (x0,  y0),  and  this 
proves  Lemma  3.  □ 


Lemma  4.  Any  solution  x(t),y(t)  of  ( 1)  which  starts  in  region  III  at  time  t  = 
t0  will  remain  in  this  region  for  all  future  time  and  ultimately  approach  the 
equilibrium  solution  x  =  x0,y  =y0. 

Proof.  Exactly  the  same  as  Lemma  3  (see  Exercise  1).  □ 

Lemma  5.  Any  solution  x(t),  y{t)  of  (1)  which  starts  in  region  II  at  time  t  — 
t0 ,  and  remains  in  region  II  for  all  future  time ,  must  approach  the 
equilibrium  solution  x  —  xQ,y  =y0  as  t  approaches  infinity. 

Proof.  If  a  solution  x(f),  y(t)  of  (1)  remains  in  region  II  for  t  >  t0 ,  then 
x(t)  is  monotonic  decreasing  andy(/)  is  monotonic  increasing  for  t>  t0. 
Moreover,  x(t)  is  positive  andy(/)  is  less  than  c2,  for  t>  t0.  Consequently, 
by  Lemma  1  of  Section  4.8,  both  x(t)  andy(/)  have  limits  £,tj  respectively, 
as  t  approaches  infinity.  Lemma  2  of  Section  4.8  implies  that  (£,rj)  is  an 
equilibrium  point  of  (1).  Now,  (£,tj)  cannot  equal  (0,0)  since  y(t)  is  increas¬ 
ing  for  t  >  t0.  Therefore,  (£,r ?)  =  (x0,y0),  and  this  proves  Lemma  5.  □ 
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Lemma  6.  Any  solution  x(t),y(t)  of  ( 1)  which  starts  in  region  IV  at  time  t  = 
t0  and  remains  in  region  IV  for  all  future  time ,  must  approach  the 
equilibrium  solution  x  =  x0,  y=y0  as  t  approaches  infinity . 

Proof.  Exactly  the  same  as  Lemma  5  (see  Exercise  2).  □ 

We  are  now  in  a  position  to  prove  Theorem  8. 

Proof  of  theorem  8.  (a)  Lemmas  3  and  4  state  that  every  solution  x(/), 
y(t )  of  (1)  which  starts  in  region  I  or  III  at  time  t  =  t0  must  approach  the 
equilibrium  solution  x  =  x0,  y  =  y0  as  t  approaches  infinity.  Similarly, 
Lemmas  5  and  6  state  that  every  solution  x(t),  y(t)  of  (1)  which  starts  in 
region  II  or  IV  and  which  remains  in  these  regions  for  all  future  time,  must 
also  approach  the  equilibrium  solution  x  =  x0,y  =y0.  Now,  observe  that  if 
a  solution  *(/),  y(t)  of  (1)  leaves  region  II  or  IV,  then  it  must  cross  the 
curve  or  y  =  <t>2(x),  and  immediately  afterwards  enter  region  I  or 

region  III.  Consequently,  all  solutions  x(t ),  y(t)  of  (1)  which  start  in  re¬ 
gions  II  and  IV  or  on  the  curves  y  =  <l>x(x)  and  j>  =  </>2(jc),  must  also  ap¬ 
proach  the  equilibrium  solution  x(t)  =  x0,y(t)  =y0.  □ 

(b)  Proof  #1.  If  axa2  is  greater  than  bxb2cxc2, ,  then  the  curves  y  =  <t>x(x) 
and  y  =  <l>2(x)  have  the  form  described  in  Figure  3  below.  In  region  I, 
dx/dt  is  positive  and  dy / dt  is  negative;  in  region  II,  both  dx/dt  and  dy / dt 
are  negative;  and  in  region  III,  dx/dt  is  negative  and  dy / dt  is  positive.  It 
is  a  simple  matter  to  show  (see  Exercise  3)  that  every  solution  x(t),y{t)  of 
(1)  which  starts  in  region  II  at  time  t  =  t0  must  remain  in  this  region  for  all 
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future  time,  and  approach  the  equilibrium  solution  x  =  0,  y  =  0  as  t  ap¬ 
proaches  infinity.  It  is  also  trivial  to  show  that  every  solution  x(t\  y(t)  of 
(1)  which  starts  in  region  I  or  region  III  at  time  t  =  t0  must  cross  the  curve 
y  =  <t>l(x)  or  y=<t>2(x),  and  immediately  afterwards  enter  region  II  (see  Ex¬ 
ercise  4).  Consequently,  every  solution  x(t),y(t)  of  (1),  with  0<x(/0)<c1 
and  0<j>(/0)<c2,  approaches  the  equilibrium  solution  x=Q,y  =0  as  t  ap¬ 
proaches  infinity.  □ 

Proof  #2.  We  would  now  like  to  show  how  we  can  use  the  Poincare- 
Bendixson  theorem  to  give  an  elegant  proof  of  part  (b)  of  Theorem  8.  Ob¬ 
serve  that  the  system  of  differential  equations  (1)  can  be  written  in  the 
form 


Thus,  by  Theorem  2  of  Section  4.3,  the  stability  of  the  solution  x=0,y  —  0 
of  (2)  is  determined  by  the  stability  of  the  equilibrium  solution  x=0,y  =  0 
of  the  linearized  system 


The  characteristic  polynomial  of  the  matrix  A  is 


X2  +  (ax  +  tf2)X  +  #  1^2  —  b\b2c\c2 

whose  roots  are 

r  2  -.1/2 

-(ax  +  a2)±[(ax  +  a2)  -4(axa2- bxb2cxc2)\ 


It  is  easily  verified  that  both  these  roots  are  real  and  negative.  Hence,  the 
equilibrium  solution  x=0,y  =  0  of  (2)  is  asymptotically  stable.  This  implies 
that  any  solution  x(t),  y(t)  of  (1)  which  starts  sufficiently  close  to  the 
origin  x=y  =  0  will  approach  the  origin  as  t  approaches  infinity.  Now, 
suppose  that  a  solution  x{t\y{t)  of  (1),  with  0<x(/o)<c1  and  0<j>(/0)< 
c2,  does  not  approach  the  origin  as  t  approaches  infinity.  By  the  previous 
remark,  this  solution  must  always  remain  a  minimum  distance  from  the 
origin.  Consequently,  its  orbit  for  t>  t0  lies  in  a  bounded  region  in  the  x  — 
y  plane  which  contains  no  equilibrium  points  of  (1).  By  the  Poincare- 
Bendixson  Theorem,  therefore,  its  orbit  must  spiral  into  the  orbit  of  a  peri¬ 
odic  solution  of  (1).  But  the  system  of  differential  equations  (1)  has  no 
periodic  solution  in  the  first  quadrant  x  >  0,  y  >  0.  This  follows  im¬ 
mediately  from  Exercise  11,  Section  4.8,  and  the  fact  that 


dx 


[ -axx  +  bx(cx-x)y]  +  [~a2y  +  b2(c2-y)x] 


=  —  (ax  +  a2  +  b\y  +  b2x) 


is  strictly  negative  if  both  x  and  y  are  nonnegative.  Consequently,  every 
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solution  x(t),y(t)  of  (1),  with  0<x(/0)<c1  and  0<y(t0)<c2  approaches 
the  equilibrium  solution  x = 0 ,y  =  0  as  t  approaches  infinity.  □ 


Now,  it  is  quite  difficult  to  evaluate  the  coefficients  ax,  a2 ,  bx,  ov  cx,  and 
c2.  Indeed,  it  is  impossible  to  obtain  even  a  crude  estimate  of  a2 ,  which 
should  be  interpreted  as  the  average  amount  of  time  that  a  female  remains 
infective.  (Similarly,  ax  should  be  interpreted  as  the  average  amount  of 
time  that  a  male  remains  infective.)  This  is  because  most  females  do  not 
exhibit  symptoms.  Thus,  a  female  can  be  infective  for  an  amount  of  time 
varying  from  just  one  day  to  well  over  a  year.  Nevertheless,  it  is  still  possi¬ 
ble  to  ascertain  from  public  health  data  that  axa2  is  less  than  bxb2cxc2 ,  as 
we  now  show.  Observe  that  the  condition  axa2<bxb2cxc2  is  equivalent  to 


1< 


v  i 
a2 


V 2 

a, 


The  quantity  bxcx/a2  can  be  interpreted  as  the  average  number  of  males 
that  one  female  infective  contacts  during  her  infectious  period,  if  every 
male  is  susceptible.  Similarly,  the  quantity  b2c2/ a  x  can  be  interpreted  as 
the  average  number  of  females  that  one  male  infective  contacts  during  his 
infectious  period,  if  every  female  is  susceptible.  The  quantities  bxcx/a2  and 
b 2c2 / ax  are  called  the  maximal  female  and  male  contact  rates,  respectively. 
Theorem  8  can  now  be  interpreted  in  the  following  manner. 

(a)  If  the  product  of  the  maximal  male  and  female  contact  rates  is  greater 
than  one,  then  gonorrhea  will  approach  a  nonzero  steady  state. 

(b)  If  the  product  of  the  maximal  male  and  female  contact  rates  is  less 
than  one,  then  gonorrhea  will  die  out  eventually. 

In  1973,  the  average  number  of  female  contacts  named  by  a  male  infec¬ 
tive  during  his  period  of  infectiousness  was  0.98,  while  the  average  number 
of  male  contacts  named  by  a  female  infective  during  her  period  of  infec¬ 
tiousness  was  1.15.  These  numbers  are  very  good  approximations  of  the 
maximal  male  and  female  contact  rates,  respectively,  and  their  product 
does  not  exceed  the  product  of  the  maximal  male  and  female  contact  rates. 
(The  number  of  contacts  of  a  male  or  female  infective  during  their  period 
of  infectiousness  is  slightly  less  than  the  maximal  male  or  female  contact 
rates.  However,  the  actual  number  of  contacts  is  often  greater  than  the 
number  of  contacts  named  by  an  infective.)  The  product  of  1.15  with  0.98 
is  1.0682.  Thus,  gonorrhea  will  ultimately  approach  a  nonzero  steady  state. 


Remark.  Our  model  of  gonorrhea  is  rather  crude  since  it  lumps  all  prom¬ 
iscuous  males  and  all  promiscuous  females  together,  regardless  of  age.  A 
more  accurate  model  can  be  obtained  by  separating  the  male  and  female 
populations  into  different  age  groups  and  then  computing  the  rate  of 
change  of  infectives  in  each  age  group.  This  has  been  done  recently,  but 
the  analysis  is  too  difficult  to  present  here.  We  just  mention  that  a  result 
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completely  analogous  to  Theorem  8  is  obtained:  either  gonorrhea  dies  out 
in  each  age  group,  or  it  approaches  a  constant,  positive  level  in  each  age 
group. 

Exercises 

In  Problems  1  and  2,  we  assume  that  axa2<bxb2cxc2. 

1.  (a)  Suppose  that  a  solution  x(t),y(t)  of  (1)  leaves  region  III  of  Figure  2  at  time 

t  =  t*  by  crossing  the  curve  y=<(>1(x)  or  y =<j>2(;c).  Conclude  that  either  x(t) 
or  y(t)  has  a  maximum  at  t  =  t*.  Then,  show  that  this  is  impossible.  Con¬ 
clude,  therefore,  that  any  solution  x(t),y(t)  of  (1)  which  starts  in  region  III 
at  time  t  =  t0  must  remain  in  region  III  for  all  future  time  t  >  t0. 

(b)  Conclude  from  (a)  that  any  solution  x(t),  y(t)  of  (1)  which  starts  in  region 
III  has  a  limit  £,17  as  t  approaches  infinity.  Then,  show  that  (£,77)  must  equal 

(*o>  ^o)- 

2.  Suppose  that  a  solution  x{t),y{t)  of  (1)  remains  in  region  IV  of  Figure  2  for  all 
time  t>  t0.  Prove  that  x(t)  andy(/)  have  limits  £,17  respectively,  as  t  approaches 
infinity.  Then  conclude  that  (£,tj)  must  equal  (x0,  y0). 

In  Problems  3  and  4,  we  assume  that  axa2>  bxb2cxc2. 

3.  Suppose  that  a  solution  x(t),y(t)  of  (1)  leaves  region  II  of  Figure  3  at  time  t  =  t* 
by  crossing  the  curve  y=<j>i(x)  or  y =<J>2(*).  Show  that  either  x(t)  or  y(t)  has  a 
maximum  at  t  =  t*.  Then,  show  that  this  is  impossible.  Conclude,  therefore,  that 
every  solution  x(t),y(t)  of  (1)  which  starts  in  region  II  at  time  t  =  /0  must  remain 
in  region  II  for  all  future  time  t  >  t0. 

4.  (a)  Suppose  that  a  solution  x(t),  y(t)  of  (1)  remains  in  either  region  I  or  III  of 

Figure  3  for  all  time  t>  t0.  Show  that  x(t)  and  y(t)  have  limits  £,17  respec¬ 
tively,  as  t  approaches  infinity. 

(b)  Conclude  from  Lemma  1  of  Section  4.8  that  (£,17)  =  (0,0). 

(c)  Show  that  (£,77)  cannot  equal  (0,0)  if  x(t),y{t)  remains  in  region  I  or  region 
III  for  all  time  t  >  t0. 

(d)  Show  that  any  solution  x(t),y(t)  of  (1)  which  starts  on  either y  =  </>1(x)  ory  = 
<f>2(.x)  will  immediately  afterwards  enter  region  II. 

5.  Assume  that  axa2  <  bxb2cxc2.  Prove  directly,  using  Theorem  2  of  Section  4.3,  that 
the  equilibrium  solution  x  =  x0,  y=y0  of  (1)  is  asymptotically  stable.  Warning : 
The  calculations  are  extremely  tedious. 

6.  Assume  that  the  number  of  homosexuals  remains  constant  in  time.  Call  this  con¬ 
stant  c.  Let  x(t)  denote  the  number  of  homosexuals  who  have  gonorrhea  at  time 
/.  Assume  that  homosexuals  are  cured  of  gonorrhea  at  a  rate  a1?  and  that  new 
infectives  are  added  at  a  rate  fil(c  —  x)x. 

(a)  Show  that  x  =  —  axx  +  /?, x(c  —  x). 

(b)  What  happens  to  x(t)  as  t  approaches  infinity? 

7.  Suppose  that  the  number  of  homosexuals  c(t)  grows  according  to  the  logistic 
law  c  =  c(a  —  be),  for  some  positive  constants  a  and  b.  Let  x(t)  denote  the  num¬ 
ber  of  homosexuals  who  have  gonorrhea  at  time  t,  and  assume  (see  Problem  6) 
that  x=  —  axx  +  pxx(c  —  x).  What  happens  to  x(t)  as  t  approaches  infinity? 
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and  Fourier  series 


5.1  Two  point  boundary-value  problems 

In  the  applications  which  we  will  study  in  this  chapter,  we  will  be  con¬ 
fronted  with  the  following  problem. 

Problem :  For  which  values  of  X  can  we  find  nontrivial  functions  y(x) 
which  satisfy 

=  0;  0y(O)  +  6/(O)  =  O,  cy(l)  + dy'(l)  =  0?  (1) 

dxL 

Equation  (1)  is  called  a  boundary- value  problem,  since  we  prescribe  infor¬ 
mation  about  the  solution  y(x)  and  its  derivative  y'(x)  at  two  distinct 
points,  x  =  0  and  x  =  l.  In  an  initial-value  problem,  on  the  other  hand,  we 
prescribe  the  value  of  y  and  its  derivative  at  a  single  point  x  =  x0. 

Our  intuitive  feeling,  at  this  point,  is  that  the  boundary-value  problem 
(1)  has  nontrivial  solutions  y  (x)  only  for  certain  exceptional  values  X.  To 
wit,  ^(x)  =  0  is  certainly  one  solution  of  (1),  and  the  existence-uniqueness 
theorem  for  second-order  linear  equations  would  seem  to  imply  that  a 
solution  y{x)  ofy"  +  Xy  =  0  is  determined  uniquely  once  we  prescribe  two 
additional  pieces  of  information.  Let  us  test  our  intuition  on  the  following 
simple,  but  extremely  important  example. 

Example  1.  For  which  values  of  X  does  the  boundary-value  problem 

^+Ay=0;  y(0)-0,  y(0-0  (2) 

have  nontrivial  solutions? 
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Solution. 

(i)  A  =  0.  Every  solution y (x)  of  the  differential  equation y"  =0  is  of  the 
loimy(x)  =  cxx  +  c2,  for  some  choice  of  constants  cl  and  c2.  The  condition 
>^(0)  =  0  implies  that  c2= 0,  and  the  condition  >>(/)  =  0  then  implies  that  cx  = 
0.  Thus,  ^(x)  =  0  is  the  only  solution  of  the  boundary- value  problem  (2), 
for  A  =  0. 

(ii)  A<0:  In  this  case,  every  solution  y(x)  of  y"  -I-  Ay  =0  is  of  the  form 

y{x)  =  cxe^~^ x  +  c2e~^~^ x ,  for  some  choice  of  constants  cx  and  c2.  The 
boundary  conditions  >>(())  =>>(/)  =  ()  imply  that 

Cj  +  c2  =  0,  lcx-\- e~^~^  lc2  =  0.  (3) 

The  system  of  equations  (3)  has  a  nonzero  solution  cx,c2  if,  and  only  if, 


det 


1 

e^7 


1 


=:e~V~X/-eV~X 


This  implies  that  =  or  e2^~^1  =  1.  But  this  is  impossible, 

since  ez  is  greater  than  one  for  z  >0.  Hence,  c,  =  c2  =  0  and  the  boundary- 
value  problem  (2)  has  no  nontrivial  solutions  y{x)  when  A  is  negative. 

(iii)  A  >  0:  In  this  case,  every  solution  y(x)  of.y"+Ay=0  is  of  the  form 
y(x)  =  cx  cosVX  x  +  c2sinVA  x,  for  some  choice  of  constants  cx  and  c2. 
The  condition  >>(0)  =  0  implies  that  Cj  =  0,  and  the  condition  >>(/)  =  0  then 
implies  that  c2sinVX  /=0.  This  equation  is  satisfied,  for  any  choice  of  c2, 
if  VA  1=mt ,  or  A=«27t2//2,  for  some  positive  integer  n.  Hence,  the 
boundary- value  problem  (2)  has  nontrivial  solutions  y(x)  =  c  sin  rnx/l  for 
A=rt27T2//2,  «  =  1,2, _ 


Remark.  Our  calculations  for  the  case  A<0  can  be  simplified  if  we  write 
every  solution y(x)  in  the  form  y  =  cx  cosh V  —A  x-hc2 sinhV  — A  x,  where 


cosh V —A  x  = 


-he 


V^x 


and 


„V  — A  x _ 0  —  V— A  x 

sinhV— A  x  = - - - . 

The  condition  j>(0)=0  implies  that  Cj=0,  and  the  condition  j>(/)  =  0  then 
implies  that  c2  sinhV  -  A  /=0.  But  sinhz  is  positive  for  z  >0.  Hence,  c2  =  0, 
and>>(x)  =  0. 

Example  1  is  indicative  of  the  general  boundary- value  problem  (1).  In¬ 
deed,  we  have  the  following  remarkable  theorem  which  we  state,  but  do 
not  prove. 
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Theorem  1.  The  boundary -value  problem  (1)  has  nontrivial  solutions  y(x) 
only  for  a  denumerable  set  of  values  Xx,X2,...,  where  \x<\2---,  and  A„  ap¬ 
proaches  infinity  as  n  approaches  infinity.  These  special  values  of  A  are 
called  eigenvalues  of  ( 1),  and  the  nontrivial  solutions  y(x)  are  called  eigen¬ 
functions  of  (1).  In  this  terminology ,  the  eigenvalues  of  (2)  are 
7r2//2,47r2//2,97r2//2,...,  and  the  eigenfunctions  of  (2)  are  all  constant 
multiples  of  sin77\x  /  /,  sin27rx 


There  is  a  very  natural  explanation  of  why  we  use  the  terms  eigenvalue 
and  eigenfunction  in  this  context.  Let  V  be  the  set  of  all  functions  y(x) 
which  have  two  continuous  derivatives  and  which  satisfy  ay(0)  +  by'(0)  = 
0 9cy(l)  +  dy'(l)  =  0.  Clearly,  V  is  a  vector  space,  of  infinite  dimension. 
Consider  now  the  linear  operator,  or  transformation  L,  defined  by  the 
equation  2 

[Ly](x)=-^{x).  (4) 


The  solutions >>(*)  of  (1)  are  those  functions >>  in  V  for  which  Ly=Xy.  That 
is  to  say,  the  solutions  y(x)  of  (1)  are  exactly  those  functions  y  in  V  which 
are  transformed  by  L  into  multiples  A  of  themselves. 


Example  2.  Find  the  eigenvalues  and  eigenfunctions  of  the  boundary-value 
problem 

^+V= 0;  y(0)+/(0)=0,  >»(1)=0.  (5) 

Solution. 

(i)  A  =  0.  Every  solution  y{x)  of  y"=0  is  of  the  ioxmy{x)  =  cxx  + cl9  for 
some  choice  of  constants  cx  and  c2.  The  conditions y(0)+y(0)  =  0  and>>(l) 
=  0  both  imply  that  c2=  —  cx.  Hence,  y(x)  =  c(x—  1),  c=£0,  is  a  nontrivial 
solution  of  (5)  when  A  =  0;  i.e.,y(x)  =  c(x—  1),  c^=  0,  is  an  eigenfunction  of 
(5)  with  eigenvalue  zero. 

(ii)  A<0.  In  this  case,  every  solution  y(x)  of  y"+A>>=0  is  of  the  form 
y(x)  =  cx  cosh V  — A  x  +  c2sinh\A-A  x,  for  some  choice  of  constants  cx  and 
c2.  The  boundary  conditions ^(0)+^'(0)  =  0  and^(l)  =  0  imply  that 

Cj  +  V  —  Ac2  =  0,  cosh V  —  A  cx  4-  sinh V  —  A  c2  =  0.  (6) 


(Observe  that  (cosh x)'  =  sinh x  and  (sinh x)'  =  cosh jc.)  The  system  of  equa¬ 
tions  (6)  has  a  nontrivial  solution  cx,c2  if,  and  only  if, 


det 


^ ^  ^  I  =  sinhV  —  A  -  V  -  A  cosh V  — A  =0. 

coshV  —  A  sinh  A 


This  implies  that 


sinhV  —  A  =  V  —  A  coshV  —  A  . 


(7) 


But  Equation  (7)  has  no  solution  A  <  0.  To  see  this,  let  z  =  V  — A  ,  and  con- 
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sider  the  function  h(z)  =  z  cosh  z  -  sinhz.  This  function  is  zero  for  z  =  0  and 
is  positive  for  z  >  0,  since  its  derivative 

h'(z )  =  cosh  z  4-  z  sinhz  —  coshz  =  z  sinhz 

is  strictly  positive  for  z  >  0.  Hence,  no  negative  number  X  can  satisfy  (7). 

(iii)  X>0.  In  this  case,  every  solution  y(x)  of  j>"  +  Xj>  =  0  is  of  the  form 
y(x)  =  cx  cosVX  x  +  c2  sinVX  x9  for  some  choice  of  constants  cx  and  c2. 
The  boundary  conditions  imply  that 

c1  +  VXc2  =  0,  cosVX  Cj  +  sinVX  c2  =  0.  (8) 

The  system  of  equations  (8)  has  a  nontrivial  solution  cvc2  if,  and  only  if, 

detl  *  |  =  sinVX  —  VX  cosVX  =0. 

\  cosVX  sinVX  / 


This  implies  that 


tanVX  =  VX  . 


To  find  those  values  of  X  which  satisfy  (9),  we  set  |=  VX  and  draw  the 
graphs  of  the  functions  17  =  ^  and  tj  =  tan£  in  the  £  — tj  plane  (see  Figure  1); 
the  £  coordinate  of  each  point  of  intersection  of  these  curves  is  then  a  root 
of  the  equation  £  =  tan£.  It  is  clear  that  these  curves  intersect  exactly  once 
in  the  interval  7r/2<£< 377/2,  and  this  occurs  at  a  point  ^>77.  Similarly, 
these  two  curves  intersect  exactly  once  in  the  interval  377/2<£<577/2,  and 
this  occurs  at  a  point  £2  >  27 7.  More  generally,  the  curves  77  =  £  and  17  =  tan£ 
intersect  exactly  once  in  the  interval 


(In— 1)77  (2az  +  1)77 

2  <^<  2 


and  this  occurs  at  a  point  ^n>m. 


Figure  1.  Graphs  of  17  =  £  and  17  =  tan  £ 
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Finally,  the  curves  tj  =  £  and  77  =  tan  £  do  not  intersect  in  the  interval  0< 
£<7t/2.  To  prove  this,  set  /i(£)  =  tan£-£  and  compute 


/*'(£)  =  sec2  £-  l  =  tan2£. 

This  quantity  is  positive  for  0  <  £  <  v  jl.  Consequently,  the  eigenvalues  of 
(5)  are  Xx  =  £2,  A2  =  £2 ,  •  •  • ,  and  the  eigenfunction  of  (5)  are  all  constant  mul¬ 
tiples  of  the  functions  —  \[\x  cos \[\x  x  +  sin  *,  —  V^2  cos 

4- sin V \2  x,... .  We  cannot  compute  X„  exactly.  Nevertheless,  we  know 
that 

n2*ir2<\n  <(2/i  4-  l)27r2/4. 

In  addition,  it  is  clear  that  \  approaches  (2 n  +  1)27t2/4  as  n  approaches  in¬ 
finity. 


Exercises 

Find  the  eigenvalues  and  eigenfunctions  of  each  of  the  following 
boundary-value  problems. 


1.  y"+Xy  =  0; 

2.  y"+Xy  = 0; 

3.  y"  —Xy=0; 

4.  y"  +\y  =  0; 

5.  y"+\y  =  0; 

6.  y"+Xy  = 0; 

7.  y"+Xy  =  0; 


y(0)  =  0,  /(/)  =  0 

y\ 0)=o,  /(/)= 0 
y\ 0)=o,  /(/)= 0 

/(0)  =  0,  y(/)  =  0 

y(0)  =  0,  y(7r)-y'(7T)  =  0 

y(0)-/(0)  =  0,  y(l)  =  0 
y(0)-/(0)  =  0,  y(7r)-y'(ir)  =  0 


8.  For  which  values  of  X  does  the  boundary-value  problem 

y"-2/  +  (l+X)y  =  0;  y( 0)  =  0,  y(l)  =  0 

have  a  nontrivial  solution? 

9.  For  which  values  of  X  does  the  boundary-value  problem 

y"+Xy  =  0;  y(0)=y(2‘ir),  /(0)=/(2tt) 

have  a  nontrivial  solution? 

10.  Consider  the  boundary-value  problem 

y"+Xy=/(0;  y(0)  =  0,  y(l)  =  0  (*) 

(a)  Show  that  (*)  has  a  unique  solution  y(t)  if  X  is  not  an  eigenvalue  of  the  ho¬ 
mogeneous  problem. 

(b)  Show  that  (*)  may  have  no  solution  y(t)  if  X  is  an  eigenvalue  of  the  homo¬ 
geneous  problem. 

(c)  Let  X  be  an  eigenvalue  of  the  homogeneous  problem.  Determine  conditions 
on  /  so  that  (*)  has  a  solution  y(t).  Is  this  solution  unique? 
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5.2  Introduction  to  partial  differential  equations 

Up  to  this  point,  the  differential  equations  that  w,e  have  studied  have  all 
been  relations  involving  one  or  more  functions  of  a  single  variable,  and 
their  derivatives.  In  this  sense,  these  differential  equations  are  ordinary  dif¬ 
ferential  equations.  On  the  other  hand,  many  important  problems  in  ap¬ 
plied  mathematics  give  rise  to  partial  differential  equations.  A  partial  dif¬ 
ferential  equation  is  a  relation  involving  one  or  more  functions  of  several 
variables,  and  their  partial  derivatives.  For  example,  the  equation 

9  3u  /  9w  \2  _  9 2u 
9x3  l  dt)  ~  dx2 

is  a  partial  differential  equation  for  the  function  u(x,t ),  and  the  equations 

du  _  dv_  9 u  __  dv 

dx  dy  ’  9 y  dx 

are  a  system  of  partial  differential  equations  for  the  two  functions  u(x,y) 
and  v(x,y).  The  order  of  a  partial  differential  equation  is  the  order  of  the 
highest  partial  derivative  that  appears  in  the  equation.  For  example,  the 
order  of  the  partial  differential  equation 

9 2u  ^  9 2u  , 

— 7=2-^-^-  +  u 
dt2  9x9 1 

is  two,  since  the  order  of  the  highest  partial  derivative  that  appears  in  this 
equation  is  two. 

There  are  three  classical  partial  differential  equations  of  order  two 
which  appear  quite  often  in  applications,  and  which  dominate  the  theory 
of  partial  differential  equations.  These  equations  are 


du  2  9  2u 

Zt~a  3*2’ 

(1) 

9  2u  2d  2u 
- =  c - 

(2) 

dt2  9x2 

9  2u  9  2u  ^ 

0x2  dy2  ' 

(3) 

Equation  (1)  is  known  as  the  heat  equation,  and  it  appears  in  the  study 
of  heat  conduction  and  other  diffusion  processes.  For  example,  consider  a 
thin  metal  bar  of  length  /  whose  surface  is  insulated.  Let  u(x9t)  denote  the 
temperature  in  the  bar  at  the  point  x  at  time  t.  This  function  satisfies  the 
partial  differential  equation  (1)  for  0<x</.  The  constant  a 2  is  known  as 
the  thermal  diffusivity  of  the  bar,  and  it  depends  solely  on  the  material 
from  which  the  bar  is  made. 
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Equation  (2)  is  known  as  the  wave  equation,  and  it  appears  in  the  study 
of  acoustic  waves,  water  waves  and  electromagnetic  waves.  Some  form  of 
this  equation,  or  a  generalization  of  it,  almost  invariably  arises  in  any 
mathematical  analysis  of  phenomena  involving  the  propagation  of  waves 
in  a  continuous  medium.  (We  will  gain  some  insight  into  why  this  is  so  in 
Section  5.7.)  The  wave  equation  also  appears  in  the  study  of  mechanical 
vibrations.  Suppose,  for  example,  that  an  elastic  string  of  length  /,  such  as 
a  violin  string  or  guy  wire,  is  set  in  motion  so  that  it  vibrates  in  a  vertical 
plane.  Let  u(x,t )  denote  the  vertical  displacement  of  the  string  at  the  point 
x  at  time  t  (see  Figure  1).  If  all  damping  effects,  such  as  air  resistance,  are 
negligible,  and  if  the  amplitude  of  the  motion  is  not  too  large,  then  u(x,t) 
will  satisfy  the  partial  differential  equation  (2)  on  the  interval  0  <  x  <  /.  In 
this  case,  the  constant  c 2  is  ///p,  where  H  is  the  horizontal  component  of 
the  tension  in  the  string,  and  p  is  the  mass  per  unit  length  of  the  string. 


Equation  (3)  is  known  as  Laplace’s  equation,  and  is  the  most  famous  of 
all  partial  differential  equations.  It  arises  in  the  study  of  such  diverse  ap¬ 
plications  as  steady  state  heat  flow,  vibrating  membranes,  and  electric  and 
gravitational  potentials.  For  this  reason,  Laplace’s  equation  is  often  re¬ 
ferred  to  as  the  potential  equation. 

In  addition  to  the  differential  equation  (1),  (2),  or  (3),  we  will  often  im¬ 
pose  initial  and  boundary  conditions  on  the  function  u.  These  conditions 
will  be  dictated  to  us  by  the  physical  and  biological  problems  themselves; 
they  will  be  chosen  so  as  to  guarantee  that  our  equation  has  a  unique  solu¬ 
tion. 

As  a  model  case  for  the  heat  equation  (1),  we  consider  a  thin  metal  bar 
of  length  /  whose  sides  are  insulated,  and  we  let  u(x9t)  denote  the  tempera¬ 
ture  in  the  bar  at  the  point  x  at  time  t.  In  order  to  dermine  the  temperature 
in  the  bar  at  any  time  t  we  need  to  know  (i)  the  initial  temperature  distrib¬ 
ution  in  the  bar,  and  (ii)  what  is  happening  at  the  ends  of  the  bar.  Are  they 
held  at  constant  temperatures,  say  0°C,  or  are  they  insulated,  so  that  no 
heat  can  pass  through  them?  (This  latter  condition  implies  that  ux(0,t)  = 
ux(l9t)  =  0.)  Thus,  a  “well  posed”  problem  for  diffusion  processes  is  the 


482 


5.3  The  heat  equation;  separation  of  variables 


heat  equation  (1),  together  with  the  initial  condition  w(x;,0)  =  f(x ),  0<x< 
/,  and  the  boundary  conditions  w(0,0  =  u(l9t)  =  0,  or  ux(0,t)  =  ux(l,t)  =  0. 

As  a  model  case  for  the  wave  equation,  we  consider  an  elastic  string  of 
length  /,  whose  ends  are  fixed,  and  which  is  set  in  motion  in  a  vertical 
plane.  In  order  to  determine  the  position  u(x,t)  of  the  string  at  any  time  t 
we  need  to  know  (i)  the  initial  position  of  the  string,  and  (ii)  the  initial 
velocity  of  the  string.  It  is  also  implicit  that  w(0,/)  =  w(/,f)  =  0.  Thus,  a  well 
posed  problem  for  wave  propagation  is  the  differential  equation  (2) 
together  with  the  initial  conditions  w(x,0)=/(x),  w/(jc,0)  =  g(x),  and  the 
boundary  conditions  u(0,t)=  w(/,/)=0. 

The  partial  differential  equation  (3)  does  not  contain  the  time  t,  so  that 
we  do  not  expect  any  “initial  conditions”  to  be  imposed  here.  In  the  prob¬ 
lems  that  arise  in  applications,  we  are  given  m,  or  its  normal  derivative,  on 
the  boundary  of  a  given  region  R ,  and  we  seek  to  determine  u(x,y)  inside 
R.  The  problem  of  finding  a  solution  of  Laplace’s  equation  which  takes  on 
given  boundary  values  is  known  as  a  Dirichlet  problem,  while  the  problem 
of  finding  a  solution  of  Laplace’s  equation  whose  normal  derivative  takes 
on  given  boundary  values  is  known  as  a  Neumann  problem. 

In  Section  5.3  we  will  develop  a  very  powerful  method,  known  as  the 
method  of  separation  of  variables,  for  solving  the  boundary-value  problem 
(strictly  speaking,  we  should  say  “initial  boundary-value  problem”) 

^  =  w(x,0)  =/(*),  0<x</;  w(0,/)  =  w(/,/)  =  0. 

vA 

After  developing  the  theory  of  Fourier  series  in  Sections  5.4  and  5.5,  we 
will  show  that  the  method  of  separation  of  variables  can  also  be  used  to 
solve  more  general  problems  of  heat  conduction,  and  several  important 
problems  of  wave  propagation  and  potential  theory. 

5.3  The  heat  equation;  separation  of  variables 

Consider  the  boundary-value  problem 

u(x,0)=f(x),  0  <x<l;  u(0,t)  =  u(l,t)  =  0.  (1) 

vA 

Our  goal  is  to  find  the  solution  u(x,t)  of  (1).  To  this  end,  it  is  helpful  to 
recall  how  we  solved  the  initial-value  problem 

-^+p{t)-^  +  q{t)y=Q\  y (0)  =/0>  y'(°)=y'o-  (2) 

First  we  showed  that  the  differential  equation 

^+p(t)^+q(t)y=0  (3) 
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is  linear;  that  is,  any  linear  combination  of  solutions  of  (3)  is  again  a  solu¬ 
tion  of  (3).  And  then,  we  found  the  solution  y(t)  of  (2)  by  taking  an  ap¬ 
propriate  linear  combination  cly1(t)  +  c2y2(0  of  two  linearly  independent 
solutions  yx(t)  and  y2(t)  of  (3).  Now,  it  is  easily  verified  that  any  linear 
combination  cxux(x,t)  +  ...  +cnun(x,t)  of  solutions  ux(x,t),...,un(x,t)  of 


3  u  _ 
dt  ~a 


2 


is  again  a  solution  of  (4).  In  addition,  if  ux(x,  t),  ...,un(x,  t)  satisfy  the 
boundary  conditions  w(0,/)  =  w(/,/)  =  0,  then  the  linear  combination  cxux 
+  ...  +  cnun  also  satisfies  these  boundary  conditions.  This  suggests  the 
following  “game  plan”  for  solving  the  boundary- value  problem  (1): 

(a)  Find  as  many  solutions  ux{x,  t),  u2(x,  t), . . .  as  we  can  of  the 
boundary-value  problem 


du 
3 1 


w(0,/)  =  w(/,/)  =  0. 


(b)  Find  the  solution  u(x,t)  of  (1)  by  taking  an  appropriate  linear  com¬ 
bination  of  the  functions  un(x,t),  n  =  1,2,... . 

(a)  Since  we  don’t  know,  as  yet,  how  to  solve  any  partial  differential 
equations,  we  must  reduce  the  problem  of  solving  (5)  to  that  of  solving  one 
or  more  ordinary  differential  equations.  This  is  accomplished  by  setting 
u(x,t)  =  X(x)T(t)  (hence  the  name  separation  of  variables).  Computing 


'~=XT'  and  ^ =X"T 
3'  dx2 

we  see  that  u(x,t)  =  X (x)T(t)  is  a  solution  of  the  equation  ut  =  a2uxx 
{ut  =  du/ 3/  and  uxx=d2u/ dx2)  if 


XT'  =  a2X"T. 


Dividing  both  sides  of  (6)  by  a2XT  gives 


(6) 

(7) 


Now,  observe  that  the  left-hand  side  of  (7)  is  a  function  of  x  alone,  while 
the  right-hand  side  of  (7)  is  a  function  of  t  alone.  This  implies  that 

^  =  -A,  and  (8) 

A  a  T 

for  some  constant  X.  (The  only  way  that  a  function  of  x  can  equal  a  func¬ 
tion  of  t  is  if  both  are  constant.  To  convince  yourself  of  this,  let  f(x)  =  g(t) 
and  fix  t0.  Then ,f(x)  =  g(t0)  for  all  x,  so  that  f(x)  =  constant  =  cx,  and  this 
immediately  implies  that  g(t)  also  equals  Cj.)  In  addition,  the  boundary 
conditions 


Q^u(Q,t)  =  X(Q)T(t), 
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and 

0=  w(/,  t)  =  X  (l)T(t) 

imply  that  Ar(0)  =  0  and  X(l)  =  0  (otherwise,  u  must  be  identically  zero). 
Thus,  u(x,t)  =  X(x)T(t)  is  a  solution  of  (5)  if 


X"  +XX  =  0\  X(0)  =  0,  Ar(/)  =  0  (9) 

and 

T'+\a2T=0.  (10) 

At  this  point,  the  constant  A  is  arbitrary.  However,  we  know  from  Example 
1  of  Section  5.1  that  the  boundary- value  problem  (9)  has  a  nontrivial  solu¬ 
tion  X (. x )  only  if  A  =  A„  =  n2 it1  / /2,  n  =  1,2,...;  and  in  this  case, 

X  (x)  =  Xn(x)  =  sin  — . 

Equation  (10),  in  turn,  implies  that 


T(t)=Tn(t)  =  e~a2n27r2t //2. 


(Actually,  we  should  multiply  both  Xn(x)  and  Tn(t)  by  constants;  however, 
we  omit  these  constants  here  since  we  will  soon  be  taking  linear  combina¬ 
tions  of  the  functions  Xn(x)Tn(t).)  Hence, 


x,/)  =  sm  — —  e 


tVJfX_  —  a2  n2  7T2 1  / 12 


is  a  nontrivial  solution  of  (5)  for  every  positive  integer  n. 

(b)  Suppose  that  f(x)  is  a  finite  linear  combination  of  the  functions 
sin  mx/l\  that  is, 

N 

ft  \  •  n^X 

/(-*)=  2j  Cn Sin  1  * 

n=\ 

Then, 

N 

/  .\  V  •  mx  —a2n27r2t/l2 

U(x9t)=  2j  CnSm-T~e 

n  =  \ 


is  the  desired  solution  of  (1),  since  it  is  a  linear  combination  of  solutions  of 
(5),  and  it  satisfies  the  initial  condition 

N 

u(x, 0)=  2)  cn sin^y-  =f(x),  0 <x<l. 

n=  1 

Unfortunately,  though,  most  functions  f(x)  cannot  be  expanded  as  a  finite 
linear  combination  of  the  functions  sin rnx/l,  n  —  1,2,...,  on  the  interval 
0  <  x  <  /.  This  leads  us  to  ask  the  following  question. 


Question :  Can  an  arbitrary  function  f(x)  be  written  as  an  infinite  linear 
combination  of  the  functions  sinwrx/7,  n=  1,2,...,  on  the  interval  0<x< 
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/?  In  other  words,  given  an  arbitrary  function  /,  can  we  find  constants 
such  that 


/./V  •  7TX  .  .  27 TX  . 

f(x)  —  cl  sin—  +  c2sin— - — I- 


...  =  2  c„sii 


mx 


0  <x<H 


n=  1 


Remarkably,  the  answer  to  this  question  is  yes,  as  we  show  in  Section  5.5. 

Example  1.  At  time  f  =  0,  the  temperature  m(x,0)  in  a  thin  copper  rod  (a2 
=  1.14)  of  length  one  is  2sin37r;t +  5sin877Jt,  0<  x  <  1.  The  ends  of  the  rod 
are  packed  in  ice,  so  as  to  maintain  them  at  0°C.  Find  the  temperature 
u(x,t)  in  the  rod  at  any  time  f>0. 

Solution.  The  temperature  u(x,t )  satisfies  the  boundary-value  problem 

w(x,0)  =  2sin37rx  +  5sin877\x,  0<x<  1 

w(0,  /)  =  w(l,/)  =  0 

and  this  implies  that 

u  ( x,  t)  =  2  sin  3  mx  e  ~  9( 1 • 1 4)77-2'  +  5  sin  8  mx  e  ~  1 ■ 1 4)’r\ 


Exercises 

Find  a  solution  u(x,t )  of  the  following  problems. 

1  w(x,0)  =  sin7rx/2  +  3sin57rx/2,  0<x<2 

’  "97”  *  1fo2’  w(0,/)=«(2,0=0 

2  9m__j^32w.  f  w(x,0)  =  sin7rx/2-3sin27rx,  0<x<2 

9*  dx2  ’  [  w(0,/)  =  w(2,/)  =  0 

3.  Use  the  method  of  separation  of  variables  to  solve  the  boundary-value  problem 

du  ___  d2u  w(x,0)  =  3sin27rx  — 7sin4T7\x:,  0<x<10 

9*  3jc2  ’  m(0,/)  =  w(10,0==0 

Use  the  method  of  separation  of  variables  to  solve  each  of  the  following 
boundary-value  problems. 

5.  =  u(t,0)  =  e~3'  +  e2' 

6.  u(Q,y)  =  2e~y  —  e2y 

7.  =  m;  w(/,0)  =  e_5'  +  2e~7'— 14e13' 
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8.  Determine  whether  the  method  of  separation  of  variables  can  be  used  to  re¬ 
place  each  of  the  following  partial  differential  equations  by  pairs  of  ordinary 
differential  equations.  If  so,  find  the  equations. 

(a)  tutt  +  ux  =  0  (b)  tuxx  4-  xut  =  0 

(c)  uxx  +  (x-y)uyy  =  0  (d)  uxx  +  2uxl  +  u,  =  0 

9.  The  heat  equation  in  two  space  dimensions  is 

W,  =  «2(w^  +  «yy)-  (*) 

(a)  Assuming  that  u(x,y,t)  =  X  (x)Y(y)T(t),  find  ordinary  differential  equa¬ 
tions  satisfied  by  X,  T,  and  T. 

(b)  Find  solutions  u(x,y,t)  of  (*)  which  satisfy  the  boundary  conditions 
w(0,y,0  =  0,  u(a,y,t)  =  Q,  u(x,0,t)  =  0,  and  u(x,b,t)  =  0. 

10.  The  heat  equation  in  two  space  dimensions  may  be  expressed  in  terms  of  polar 
coordinates  as 


ut  =  a2 


1  ,  1 

“rr  +  T  “r  +  “7*% 

r  r 


Assuming  that  u(r,9j)=R(r)S(9)T(t ),  find  ordinary  differential  equations 
satisfied  by  R ,  0,  and  T. 


5.4  Fourier  series 

On  December  21,  1807,  an  engineer  named  Joseph  Fourier  announced  to 
the  prestigious  French  Academy  of  Sciences  that  an  arbitrary  function  f(x) 
could  be  expanded  in  an  infinite  series  of  sines  and  cosines.  Specifically,  let 
/ (x)  be  defined  on  the  interval  -/<*</,  and  compute  the  numbers 


a„=  i  j  f(x)  cos  t  dx, 

n—  0, 1,2, . . . 

(i) 

and 

i  1  fl  rs  \  ■  mx 

bn=  1  J  f C*)sm  l 

dx , 

n  =  1,2, . . . 

(2) 

Then,  the  infinite  series 

a0  .  7TX  .  i  •  7TX  .  ao  , 

2  +  ci\  cos  ^  sm  +  . . .  =  ^  4" 

oo 

Z| 

n=\ 

r  yvhx  ,  ,  .  nmx  ' 

an  cos  — — h  bn  sin  — — 

(3) 

converges  to  f(x).  Fourier’s  announcement  caused  a  loud  furor  in  the 
Academy.  Many  of  its  prominent  members,  including  the  famous  mathe¬ 
matician  Lagrange,  thought  this  result  to  be  pure  nonsense,  since  at  that 
time  it  could  not  be  placed  on  a  rigorous  foundation.  However,  mathema¬ 
ticians  have  now  developed  the  theory  of  “Fourier  series”  to  such  an  extent 
that  whole  volumes  have  been  written  on  it.  (Just  recently,  in  fact,  they 
have  succeeded  in  establishing  exceedingly  sharp  conditions  for  the  Four¬ 
ier  series  (3)  to  converge.  This  result  ranks  as  one  of  the  great  mathemati- 
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cal  theorems  of  the  twentieth  century.)  The  following  theorem,  while  not 
the  most  general  theorem  possible,  covers  most  of  the  situations  that  arise 
in  applications. 

Theorem  2.  Let  f  and  f  be  piecewise  continuous  on  the  interval  —  /  <  x  <  /. 
( This  means  that  f  and  f  have  only  a  finite  number  of  discontinuities  on 
this  interval ,  and  both  f  and  f  have  right-  and  left-hand  limits  at  each 
point  of  discontinuity.)  Compute  the  numbers  an  and  bn  from  (1)  and  (2) 
and  form  the  infinite  series  (3).  This  series ,  which  is  called  the  Fourier 
series  for  f  on  the  interval  —  l  <  *  <  /,  converges  to  /(x)  if  f  is  continuous 
at  x ,  and  to  j[/(x +  0)+/(x  —  0)]*  if  f  is  discontinuous  at  x.  At  x  =  ±  /, 

the  Fourier  series  (3)  converges  to  \[f(l)  +/(  —  /)],  where  f(±l)  is  the 
limit  of  /(x)  as  x  approaches  ±  /. 

Remark.  The  quantity  |[/(x  +  0)+/(x-0)]  is  the  average  of  the  right- 
and  left-hand  limits  of  /  at  the  point  x.  If  we  define  /(x)  to  be  the  average 
of  the  right-  and  left-hand  limits  of  /  at  any  point  of  discontinuity  x ,  then 
the  Fourier  series  (3)  converges  to  /(x)  for  all  points  x  in  the  interval  -  /  < 
x<  l. 


Example  1.  Let  /  be  the,  function  which  is  0  for  -  1  <  x  <0  and  1  for  0  <  x 
<  1.  Compute  the  Fourier  series  for  /  on  the  interval  —  1  <  x  <  1. 

Solution.  In  this  problem,  /=  1.  Hence,  from  (1)  and  (2), 


a 


dx=  1, 


and 


an=  f  f (x) cos mrx dx  =  f  cos nwx dx  =  0,  n>l 

J-\  Jo 

bn—  f  f (x) sin mrxdx  =  f  sin mxdx 

J-\  Jo 


=  —  (1  —  cosmr)  = 

A277  mr 


n>  \. 


Notice  that  bn  =0  for  n  even,  and  bn  —  2/m  for  n  odd.  Hence,  the  Fourier 
series  for  /  on  the  interval  —  1  <  x  <  1  is 

1  .  2sin7rx  .  2sin37rx  .  2sin57rx  . 

- -j-  - - - h - - -  "r  . . .  . 

2  77  377  577 

By  Theorem  2,  this  series  converges  to  0  if  —  1  <x<0,  and  to  1  if  0<x<  1. 
At  x=  —  1,  0,  and  +  1,  this  series  reduces  to  the  single  number  \ ,  which  is 
the  value  predicted  for  it  by  Theorem  2. 


*The  quantity  /(*  +  0)  denotes  the  limit  from  the  right  of  /  at  the  point  x.  Similarly,  f(x  —  0) 
denotes  the  limit  of  /  from  the  left. 
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Example  2.  Let  /  be  the  function  which  is  1  for  —  2  <  x  <0  and  x  for  0  <  x 
<  2.  Compute  the  Fourier  series  for  /  on  the  interval  —  2  <  x  <  2. 

Solution .  In  this  problem  1=2.  Hence  from  (1)  and  (2), 


1 

2 


mrx 

2 


dx 


and 


(COS  WIT  —  1), 


.  mrx  , 
sin  —  dx 


1 


- (1  +COSAJ77-), 

mr 


1  . 
=  2UX 


n>  1. 


mrx  j  ,  1 
sin^—  dx+  — 


/„ 


•  mrx  i 
xsin-^—  dx 


Notice  that  an  =0  if  n  is  even;  an  =  —4/n27r2  if  n  is  odd;  bn  =  0  if  n  is  odd; 
and  b=—  2/ wr  if  w  is  even.  Hence,  the  Fourier  series  for  /  on  the  interval 
-2<x<2  is 


1 


4 

- -cos 

*rr2 


TTX 

2 


sin77-A:  — 


77 


4  37 rx  1  .  ^  , 

— -cos— - —  sin277X+  ... 

9^2  2  277 


00 


=  1 


7T 


2 


COS(2fl+  1)77x/2 


00 


„«o  (2/i+ iy 


«=1 


Sin^77X 

n 


(4) 


By  Theorem  2,  this  series  converges  to  1  if  —  2  <  x  <  0;  to  x,  if  0  <  x  <  2;  to 
\  if  x  =  0;  and  to  f  if  x±2.  Now,  at  x  =  0,  the  Fourier  series  (4)  is 


1- 


\+ 

32  52  72 


Thus,  we  deduce  the  remarkable  identity 


or 


i+i-+— +— + 

l2  32  52  72  8  ‘ 


The  Fourier  coefficients  an  and  bn  defined  by  (1)  and  (2)  can  be  derived 
in  a  simple  manner.  Indeed,  if  a  piecewise  continuous  function  /  can  be  ex¬ 
panded  in  a  series  of  sines  and  cosines  on  the  interval  —  l<x<  /,  then,  of 
necessity,  this  series  must  be  the  Fourier  series  (3).  We  prove  this  in  the 


489 


5  Separation  of  variables  and  Fourier  series 


following  manner.  Suppose  that  /  is  piecewise  continuous,  and  that 


/W=y  +  2 


oo 


knx  ,  J  .  kmX 

ck  cos— — hisin 


k=  1  L 


/ 


/ 


(5) 


for  some  numbers  ck  and  <4-  Equation  (5)  is  assumed  to  hold  at  all  but  a 
finite  number  of  points  in  the  interval  -/<*</.  Integrating  both  sides  of 

(5)  between  -/  and  /  gives  cQl=  J  ^  f(x)dx ,  since 


J  cos dx=  j  sin-^y^  rfx  =  0; 


&=  1,2,... 


Similarly,  multiplying  both  sides  of  (5)  by  cosmx/l  and  integrating  be¬ 
tween  —  /  and  /  gives 


(  /(*) 

J-i 


nmx  , 
cos— y-  ax; 


while  multiplying  both  sides  of  (5)  by  sin  rmx /l  and  integrating  between 
—  /  and  /  gives 


ldn  =  j  /(x)sin-^y^  dx. 


This  follows  immediately  from  the  relations  (see  Exercise  19) 


/' 

J-i 

f 


nmx  kmx  , 
cos — : —  cos — —  dx  = 


l 


nmx  • 
cos — -sin 


/ 


/ 

kmx 

l 


0, 

u 


k=£n 

k~n 


dx  =  0 


(6) 


(7) 


and 

r/sin«^sin^^=( o,  (8) 

JL  /  l  /  l  /,  k  =  n 

Hence,  the  coefficients  cn  and  dn  must  equal  the  Fourier  coefficients  an  and 
bn.  In  particular,  therefore,  a  function  /  can  be  expanded  in  one,  and  only 
one,  Fourier  series  on  the  interval  —  l <x<L 


Example  3.  Find  the  Fourier  series  for  the  function  f{x)  —  cos2*  on  the  in¬ 
terval  —  m  <  x  <  m. 

Solution.  By  the  preceding  remark,  the  function  f(x)  =  cos2x  has  a  unique 
Fourier  series 


00 

2  [  °n  COS 
n=  \  L 


nmx 

m 


+  bn  sin 


nmx 

m 


ao 


2 


00 

+  2  [a„cosnx  +  bnsinnx] 

n  —  \ 


*It  can  be  shown  that  it  is  permissible  to  integrate  the  series  (5)  term  by  term. 
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on  the  interval  —  m  <  a  <  m.  But  we  already  know  that 


cos2a  = 


.  cos  2  a: 


Hence,  the  Fourier  series  for  cos2  a;  on  the  interval  —  7r  <  a  <  7r  must  be 
5  +  ^cos2a. 


The  functions  cos mrx/ 1  and  sinmrx/l,  n  — 1,2,...  all  have  the  interest¬ 
ing  property  that  they  are  periodic  with  period  21;  that  is,  they  repeat 
themselves  over  every  interval  of  length  21.  This  follows  trivially  from  the 
identities 

m  ,  .  ~7>.  /  mrx  ,  ~  \  mrx 

cos  —  ( x  +  21 )  =  cos^  — — h  2mr  J  =  cos  —j~ 

and 

.  m ,  .  .  /  mrx  t  ~  \  .  mrx 

sin — ( x  +  21 )  =  sin^  -y-  +  2m  J  =  sin  — y- . 

Hence,  the  Fourier  series  (3)  converges  for  all  x  to  a  periodic  function 
F(x).  This  function  is  called  the  periodic  extension  of  /(a).  It  is  defined  by 
the  equations 


F(a)=/(a),  /<  a  <  / 

•  F(x)  =  \[f(l)+f(-l)],  x=±l 
F(a  +  2/)  =  F(a). 

For  example,  the  periodic  extension  of  the  function  /( x)  =  x  is  described  in 
Figure  1,  and  the  periodic  extension  of  the  function  /(a)  =  |x|  is  the  saw¬ 
toothed  function  described  in  Figure  2. 
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Exercises 

In  each  of  Problems  1-13,  find  the  Fourier  series  for  the  given  function/ 
on  the  prescribed  interval. 


-1  <x<0. 
0  <  x  <  1  ’ 


x  <\ 


i.  /<*>-  { - !; 

1 '<*>-{$  w<2 

3.f(x)  =  x;  —  1  <  x  <  1 


4.  /(*)=[ 


=  1  -X, 
X, 


~\<x<0-,  W<1 

0<  x  <  1 


2  <  x<0 

o  <  jc  <  l ;  |x|<2 

1  <  x  <  2 


w<2 

w</ 


9.  f(x)- 


—  X 


l ex9 


—  I  <  x  <0 . 
0<  x  <  /’ 


/<  x  <  / 


11.  f(x)  =  e  x ;  \x\  </ 

13. /(x)  =  sin3x;  |x|<?r 


10.  f(x)=ex;  \x\  <  / 

12.  /(x)  =  sin2x;  |x|  <  7r 

14.  Let  /(x)  =  (7rcosa;t)/2asintf7r,  a  not  an  integer. 

(a)  Find  the  Fourier  series  for  /  on  the  interval  —  m  <  x  <  it. 

(b)  Show  that  this  series  converges  at  x  =  tt  to  the  value  (tt /2a)  cot  ira. 

(c)  Use  this  result  to  sum  the  series 


1 


l2-<r 


+ 


1 


22 


+ 


1 


a‘ 


32  —  a: 


« *  •  * 


15.  Suppose  that  /  and  /'  are  piecewise  continuous  on  the  interval  —  l  <x  <1.  Show 
that  the  Fourier  coefficients  an  and  bn  approach  zero  as  n  approaches  infinity. 

16.  Let 

00 

\  a°  .  r  nvx  ,  u  •  nmx  1 

f(x)=  —  +  2_,  [a„cos  —  +  bnsm—  J. 

n=\ 

Show  that 

I  fl  f2(x)dx=Y+  2  (««+*»)• 

~  l  n  —  1 

This  relation  is  known  as  Parseval’s  identity.  Hint:  Square  the  Fourier  series 
for  /  and  integrate  term  by  term. 
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17.  (a)  Find  the  Fourier  series  for  the  function f(x)  =  x2  on  the  interval  —tt<x< 


IT. 


(b)  Use  Parseval’s  identity  to  show  that 


±  +  ±  +  _U 

I4  24  34  90 


18.  If  the  Dirac  delta  function  8  (x)  had  a  Fourier  series  on  the  interval  -  i  <  x  <  /, 
what  would  it  be? 

19.  Derive  Equations  (6)— (8).  Hint:  Use  the  trigonometric  identities 

sin>l  cos 5  =  ^[sinC4  +  B)  +  sin(A  -  £)] 
sin^4  sin  5  =  £[cos(yl  —  B  )  —  cos04  +  B  )] 
cosv4cosJ?=  £[cosC4  +  Z?)  +  cosC4  —  5)]. 


5.5  Even  and  odd  functions 

There  are  certain  special  cases  when  the  Fourier  series  of  a  function  /  re¬ 
duces  to  a  pure  cosine  or  a  pure  sine  series.  These  special  cases  occur  when 
/  is  even  or  odd. 

Definition.  A  function  /  is  said  to  be  even  if  /( —  x)  =  / (x). 

Example  1.  The  function  f(x)  =  x2  is  even  since 

/(-*)  =  (- x)2  =  x2  =/(*). 

Example  2.  The  function  f(x)  =  cosmrx/ 1  is  even  since 

r,  v  —  mrx  mrx  £,  x 
/(  -  x)  =  COS  — - —  =  cos -J-  =/(*). 

Definition.  A  function/  is  said  to  be  odd  if  /(-  x)=  —f(x). 

Example  3.  The  function  f(x)  =  x  is  odd  since 

f(-x)=-x=-f(x). 

Example  4.  The  function  f(x)  =  sin  nirx/l  is  odd  since 

/( -  x)  =  sin  — - —  =  -  sin  —  =  -f(x). 

Even  and  odd  functions  satisfy  the  following  elementary  properties. 

1.  The  product  of  two  even  functions  is  even. 

2.  The  product  of  two  odd  functions  is  even. 

3.  The  product  of  an  odd  function  with  an  even  function  is  odd. 

The  proofs  of  these  assertions  are  trivial  and  follow  immediately  from  the 
definitions.  For  example,  let /  and  g  be  odd  and  let  h(x)=f(x)g(x).  This 
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function  h  is  even  since 

h(-x)=f(-x)g(-x)=[  -f(x)  ][-£(*)]  =f(x)  g(x)  =  h  (x). 

In  addition  to  the  multiplicative  properties  1-3,  even  and  odd  functions 
satisfy  the  following  integral  properties. 

4.  The  integral  of  an  odd  function  /  over  a  symmetric  interval  [  —  /,/]  is 
zero;  that  is,  j1  ^f(x)dx  =  0  if  /  is  odd. 

5.  The  integral  of  an  even  function  /  over  the  interval  [  —  /,/]  is  twice  the 
integral  of  /  over  the  interval  [0, /];  that  is, 


if  /  is  even. 


j  f(x)dx  =  2  J  f(x)dx 


Proof  of  property  4.  If  /  is  odd,  then  the  area  under  the  curve  of  /  be¬ 
tween  —  /  and  0  is  the  negative  of  the  area  under  the  curve  of  /  between  0 

and  /.  Hence,  j 1  /( x)  dx  =  0  if  /  is  odd.  □ 

Proof  of  property  5.  If  /  is  even,  then  the  area  under  the  curve  of  /  be¬ 
tween  -  /  and  0  equals  the  area  under  the  curve  of  /  between  0  and  /. 
Hence, 

f  f(x)dx  = 

if  /  is  even.  □ 

Concerning  even  and  odd  functions,  we  have  the  following  important 
lemma. 


Lemma  1. 

(a)  The  Fourier  series  for  an  even  function  is  a  pure  cosine  series ;  that 
is ,  it  contains  no  terms  of  the  form  sin  mrx/L 

(b)  The  Fourier  series  for  an  odd  function  is  a  pure  sine  series ;  that  is, 
it  contains  no  terms  of  the  form  cos  mrx/l. 

Proof,  (a)  If  /  is  even,  then  the  function  f(x)smmrx/ 1  is  odd.  Thus,  by 
Property  4,  the  coefficients 

bn:='jf  f(x)sin^Y^-dx,  h  =  1,2,3,... 

in  the  Fourier  series  for  /  are  all  zero. 

(b)  If  /  is  odd,  then  the  function  f(x)cosmrx/l  is  also  odd.  Conse¬ 
quently,  by  Property  4,  the  coefficients 

an~fj  f(x)C0S^j^dx,  72=0,1,2,... 

in  the  Fourier  series  for  /  are  all  zero.  □ 
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We  are  now  in  a  position  to  prove  the  following  extremely  important  ex¬ 
tension  of  Theorem  2.  This  theorem  will  enable  us  to  solve  the  heat  con¬ 
duction  problem  of  Section  5.3  and  many  other  boundary-value  problems 
that  arise  in  applications. 


Theorem  3.  Let  f  and  f  be  piecewise  continuous  on  the  interval  0  <  x  <  /. 
Then ,  on  this  interval ,  f(x)  can  be  expanded  in  either  a  pure  cosine  series 


a 


o 


2 


,  V  mx 

+  tfnCOS— — 


or  a  pure  sine  series 


00 


2  &„sin 


i 


mx 

l 


In  the  former  case ,  the  coefficients  an  are  given  by  the  formula 

an  =  jf  f(x)cos^~~dx,  n  =  0,1,2,... 

/  J0  l 

while  in  the  latter  case ,  the  coefficients  bn  are  given  by  the  formula 


n 


l 


f/(x) 

Jo 


nrrx  , 
sin  — r—  ax. 


I 


(1) 

(2) 


Proof.  Consider  first  the  function 


fW=  /(4  o<x<i 

\f(-x),  -l<x<  O' 

The  graph  of  F(x)  is  described  in  Figure  1,  and  it  is  easily  seen  that  F  is 
even.  (For  this  reason,  F  is  called  the  even  extension  of  /.)  Hence,  by 
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Lemma  1,  the  Fourier  series  for  F  on  the  interval  —  /  <  x  <  /  is 

00  , 

F(x)=y  +  2  «„COSy^;  an=lj  F(X)C0S^Y'dx-  (3) 
Now,  observe  that  the  function  F{x)cosmrx/ 1  is  even.  Thus,  by  Property  5 


Finally,  since  F(x)  =  f(x),  0  <  x  <  /,  we  conclude  from  (3)  that 

00 

/(*)=  y  +  2ll  anCOS—>  0  <*</. 

n=  1 

Observe  too,  that  the  series  (3)  converges  to  /(x)  for  x  =  0  and  x  —  L 
To  show  that  f{x)  can  also  be  expanded  in  a  pure  sine  series,  we  con¬ 
sider  the  function 


G(x) 


/(*). 
.  0, 


0<x</ 
— /<x<0 
x  =  0,  ±  /. 


The  graph  of  G(x)  is  described  in  Figure  2,  and  it  is  easily  seen  that  G  is 
odd.  (For  this  reason,  G  is  called  the  odd  extension  of  /.)  Hence,  by 
Lemma  1,  the  Fourier  series  for  G  on  the  interval  —  /  <  x  <  l  is 


00 


GW=  2  *>„sin^y ; 

n=l 


•  nmx  i 
sm— —  ax. 
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Now,  observe  that  the  function  G(x)  smmrx/l  is  even.  Thus,  by  Property 

5, 


Finally,  since  G(x)  =  f(x),0<x<l,  we  conclude  from  (4)  that 

00 

f(X)=^lbnsin~ JL>  0  <X<1. 

n-1 


Observe  too,  that  the  series  (4)  is  zero  for  x  =  0  and  x  =  /. 


Example  5.  Expand  the  function  / (x)  =  1  in  a  pure  sine  series  on  the  inter¬ 
val  0  <  x  <  ir. 

Solution .  By  Theorem  3 ,/(*)=  2  7=\bnsinnx>  where 


Hence, 


2 
7 T 


IT 

sin  nxdx  — 


r 


L  wn 


n  even 
n  odd. 


sinx 

1 


sin  3.x 
3 


sin  5.x 
5 


0<X<7T. 


Example  6.  Expand  the  function  f(x)  =  ex  in  a  pure  cosine  series  on  the  in¬ 
terval  0<  *  <  1. 

Solution .  By  Theorem  3,/(x)  =  a0/2+  2T=i ancosn7TX>  where 


Hence, 


=  2  Re 


[  (ecoswr—  1)(1  —  imr)  ] 
\  +  n2ir2 


2(ecos  mr—  1) 
1  H-/22tt2 


00 


ex  =  e—  1+2^ 


n=\ 


(ecos  mr—  1) 
1  +  n2  7r2 


cos  mrx,  0  <  x  <  1 . 
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Exercises 


Expand  each  of  the  following  functions  in  a  Fourier  cosine  series  on  the 
prescribed  interval. 


1.  /(*)=  e  x;  0<  x  <  1 


!•/(*)- {5’  o<*<2 


0<  x <  1 . 
1  <x<2’ 


3.  f(x)-  f  0 <  *<a  ;  0<x<2a 

la,  a  <  x  <  2a 

4.  /(*)==  cos2*;  0  <  a:  <  77 


Expand  each  of  the  following  functions  in  a  Fourier  sine  series  on  the  pre¬ 
scribed  interval. 


6.  f(x)=e  *;  0< x<  1 


0<x  < 1 . 
1  <x<2’ 


8 


a, 


0 <x<a  . 
a  <  x  <2a  ’ 


0<  x<2a 


9. /(x)  =  2sinxcosx;  0<x<tt 


10.  /(*)- 


0<x  <  //2 1 

i/i<x<r 


0<x  <  / 


0<x<2 


11.  (a)  Expand  the  function  /(*)  =  sin;c  in  a  Fourier  cosine  series  on  the  interval 
0  <  X  <  7T. 

(b)  Expand  the  function /(*)  =  cos*  in  a  Fourier  sine  series  on  the  interval  0< 

X  <  TT. 

(c)  Can  you  expand  the  function  /(x)  =  sin  x  in  a  Fourier  cosine  series  on  the 
interval  —  tt  <  x  <  tt?  Explain. 


5.6  Return  to  the  heat  equation 

We  return  now  to  the  boundary- value  problem 

w(x,0)=/(x) 

[  w(0,/)  =  w(/,/)  =  0 


9  u  ?92w 


9 1 


=  a2 


9x 


2  ’ 


We  showed  in  Section  5.3  that  the  function 

00 

/  ^  V  .  mx  e-a2n2ir2t/l 2 

U{x,t)=  2j  Cn Sin  1  “ 

n  =  1 

is  (formally)  a  solution  of  the  boundary-value  problem 


9  u  2<$2u 
—  =  a  - - 


u(0,t)  =  u(l ,  /)  =  0 


(i) 


(2) 


3/  ~  ax2  ’ 

for  any  choice  of  constants  cvc2,... .  This  led  us  to  ask  whether  we  can 
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find  constants  cl9c2,..>  such  that 

00 

u (x, 0)  =  2  c„ sin =f(x),  0<x<l.  (3) 

n-  1 

As  we  showed  in  Section  5.5,  the  answer  to  this  question  is  yes;  if  we 
choose 


2  fl  r,  \  •  mrx  , 

Cn=~j  J  f(x)  sin  — dx, 

then  the  Fourier  series  2T-i cnsinmrx/l  converges  to  f(x)  if  / is  continu¬ 
ous  at  the  point  x .  Thus, 


00 


n—\  L 


/ 


u(xyt)  =  -y  2  f  f(x)  sin ^—~dx 
l  -  I  •'O 


/ 


si 


is  the  desired  solution  of  (1). 


Remark.  Strictly  speaking,  the  solution  (4)  cannot  be  regarded  as  the  solu¬ 
tion  of  (1)  until  we  rigorously  justify  all  the  limiting  processes  involved. 
Specifically,  we  must  verify  that  the  function  u(x,t)  defined  by  (4)  actually 
has  partial  derivatives  with  respect  to  x  and  /,  and  that  u(x,t )  satisfies  the 
heat  equation  ut  =  a2uxx.  (It  is  not  true,  necessarily,  that  an  infinite  sum  of 
solutions  of  a  linear  differential  equation  is  again  a  solution.  Indeed,  an 
infinite  sum  of  solutions  of  a  given  differential  equation  need  not  even  be 
differentiable.)  However,  in  the  case  of  (4)  it  is  possible  to  show  (see  Ex¬ 
ercise  3)  that  u(xyt)  has  partial  derivatives  with  respect  to  x  and  t  of  all 
orders,  and  that  u(x,t )  satisfies  the  boundary-value  problem  (1).  The  argu¬ 
ment  rests  heavily  upon  the  fact  that  the  infinite  series  (4)  converges  very 
rapidly,  due  to  the  presence  of  the  factor  e-«2n2"2t/l\  Indeed,  the  function 
u(x,t ),  for  fixed  f>0,  is  even  analytic  for  0<x</.  Thus,  heat  conduction 
is  a  diffusive  process  which  instantly  smooths  out  any  discontinuities  that 
may  be  present  in  the  initial  temperature  distribution  in  the  rod.  Finally, 
we  observe  that  lim^oow(x,/)  =  0,  for  all  x ,  regardless  of  the  initial  temper¬ 
ature  in  the  rod.  This  is  in  accord  with  our  physical  intuition  that  the  heat 
distribution  in  the  rod  should  ultimately  approach  a  “steady  state”;  that  is, 
a  state  in  which  the  temperature  does  not  change  with  time. 

Example  1.  A  thin  aluminum  bar  (a2  =  0.86  cm2/s)  10  cm  long  is  heated  to 
a  uniform  temperature  of  100°C.  At  time  f  =  0,  the  ends  of  the  bar  are 
plunged  into  an  ice  bath  at  0°C,  and  thereafter  they  are  maintained  at  this 
temperature.  No  heat  is  allowed  to  escape  through  the  lateral  surface  of  the 
bar.  Find  an  expression  for  the  temperature  at  any  point  in  the  bar  at  any 
later  time  t. 
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Solution .  Let  u{x,t)  denote  the  temperature  in  the  bar  at  the  point  x  at 
time  t.  This  function  satisfies  the  boundary-value  problem 

to-OM&L;  H*-0)-100,  0<*<l0  (5) 

dt  dx2  [  u(0,t)  =  w(10,/)  =  0 

The  solution  of  (5)  is 

u(x,t)  =  2  cBsin^^-°-86nV'/100 

n  =  1 

where 


10 


1  nn  •  Mr*  J  200  f ,  v 

100sm-r^-«*  = - (1  —  coswr). 


10 


mr 


Notice  that  cw  =  0  if  n  is  even,  and  c/J  =  400/wr  if  n  is  odd.  Hence, 


7 T 

n=\ 


oo  sin(2«  + 1) 


7 IX 
10 


-0.86(2/H-  l)Vf/ 100 

w  • 


(2rt  + 1) 


There  are  several  other  problems  of  heat  conduction  which  can  be 
solved  by  the  method  of  separation  of  variables.  Example  2  below  treats 
the  case  where  the  ends  of  the  bar  are  also  insulated,  and  Exercise  4  treats 
the  case  where  the  ends  of  the  bar  are  kept  at  constant,  but  nonzero  tem¬ 
peratures  Tx  and  T2. 

Example  2.  Consider  a  thin  metal  rod  of  length  /  and  thermal  diffusivity 
a2,  whose  sides  and  ends  are  insulated  so  that  there  is  no  passage  of  heat 
through  them.  Let  the  initial  temperature  distribution  in  the  rod  be  / (x). 
Find  the  temperature  distribution  in  the  rod  at  any  later  time  t . 

Solution.  Let  u(x,t)  denote  the  temperature  in  the  rod  at  the  point  x  at 
time  t.  This  function  satisfies  the  boundary-value  problem 

du  _  2  32w 

a*2; 

We  solve  this  problem  in  two  steps.  First,  we  will  find  infinitely  many  solu¬ 
tions  un{x,t)  —  Xn(x)Tn(t)  of  the  boundary- value  problem 

ut  =  a2uxx;  ux(0,t)  =  ux(l,t)=0,  (7) 

and  then  we  will  find  constants  c0,clfc2,...  such  that 

00 

U(x,t)=  2  WniX’t) 

n  —  0 

satisfies  the  initial  condition  u(x,0)=f(x). 
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u(x,0)=f(x),  0<*</ 

ux(0,t)  =  ux(l,t)  =  0 


(6) 


5.6  Return  to  the  heat  equation 


Step  1:  Let  u(x,t)  =  X  (x)T  (t).  Computing 

^-=XT'  and  ^ =X"T 
3 1  9*2 

we  see  that  u(x,t )  is  a  solution  of  ut  =  a2uxx  if 

XT'  =  ol2X"T,  or  ^  =  (8) 

X  a2T 

As  we  showed  in  Section  5.3,  Equation  (8)  implies  that 

X"+XX=0  and  T'  +  Xol2T= 0 
for  some  constant  X.  In  addition,  the  boundary  conditions 

0=ux(0,t)  =  X'(0)T(t )  and  0  =  ux(l,t)  =  X'(l)T(t) 

imply  that  Ar'(0)  =  0  and  X'(l)  =  0.  Hence  u(x,t)  =  X(x)T(t)  is  a  solution 
of  (7)  if 

X”+XX  =  0;  X'(0)  =  09  X'(l)= 0  (9) 

and 

r+\a2r=*o.  (10) 

At  this  point,  the  constant  X  is  arbitrary.  However,  the  boundary-value 
problem  (9)  has  a  nontrivial  solution  X(x)  (see  Exercise  1,  Section  5.1) 
only  if  X  =  n2 m2 / /2,  n  =  0, 1, 2, ... ,  and  in  this  case 

X  (x)  =  Xn  (x)  =  cos  ~~y~  • 

Equation  (10),  in  turn,  implies  that  T(t)  =  e~a2n27r2t //2.  Hence, 


un(x,t)  =  cos 


mx 

l 


g  —  a2  n2  it2  t  / 12 


is  a  solution  of  (7)  for  every  nonnegative  integer  n. 
Step  2:  Observe  that  the  linear  combination 


00 


M(^,0  =  y+Ec«cosyg  aW'//2 

n  —  1 


is  a  solution  (formally)  of  (7)  for  every  choice  of  constants  c0,cvc29... 
initial  value  is 


.  Its 


r  00 

(  cw  0  .  V  nnx 

U(X,0)=  —  +  2j  Cn  C°S  J  • 

n  =  1 

Thus,  in  order  to  satisfy  the  initial  condition  w(x,0)=  f(x)  we  must  choose 
constants  c0,  cv  c2, . . .  such  that 

/(*)=  y  +  2  cncos^7^>  o <X<1. 

n=l 

In  other  words,  we  must  expand /in  a  Fourier  cosine  series  on  the  interval 
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0  <  x  <  /.  This  is  precisely  the  situation  in  Theorem  3  of  Section  5.5,  and 
we  conclude,  therefore,  that 


Hence, 


n 


dx. 


00  r 

u(x,t)=jf^f(x)dx+  jf 


/  (x)  cos  ^j^-dx 


cos^e“aW'//2  (11) 


is  the  desired  solution  of  (6). 


Remark.  Observe  from  (11)  that  the  temperature  in  the  rod  ultimately  ap¬ 
proaches  the  steady  state  temperature 

1  rl 

This  steady  state  temperature  can  be  interpreted  as  the  “average”  of  the 
initial  temperature  distribution  in  the  rod. 


Exercises 


(c)  «(x,0)  = 

(d)  w(x,  0)  = 


1.  The  ends  x  =  0  and  x  =  10  of  a  thin  aluminum  bar  (a2  =  0.86)  are  kept  at  0°C, 
while  the  surface  of  the  bar  is  insulated.  Find  an  expression  for  the  temperature 
u(x,t)  in  the  bar  if  initially 

(a)  w(x,0)  =  70,  0<x<10 

(b)  m(x,0)  =  70cosx,  0<x<10 

f  lOx,  0<x<5 

10(10  — x),  5  <  x<  10 

0,  0<  x  <3 

65,  3<x<10 

2.  The  ends  and  sides  of  a  thin  copper  bar  (a2  =  1.14)  of  length  2  are  insulated  so 
that  no  heat  can  pass  through  them.  Find  the  temperature  u(x,t)  in  the  bar  if 
initially 

(a)  w(x,0)  =  65cos27tx,  0<x<2 

(b)  «(x,0)  =  70sinx,  0<x<2 

60x,  0  <  x  <  1 

60(2  —  x),  1  <  x  <  2 

0,  0  <  x  <  1 

75,  l<x<2 


(c)  «(x,0)  = 

(d)  w(x,0)=  | 


3.  Verify  that  the  function  u{x,t)  defined  by  (4)  satisfies  the  heat  equation.  Hint : 
Use  the  Cauchy  ratio  test  to  show  that  the  infinite  series  (4)  can  be  differenti¬ 
ated  term  by  term  with  respect  to  x  and  t. 


4.  A  steady  state  solution  u(x,t )  of  the  heat  equation  ut  =  a2 uxx  is  a  solution  u(x,t) 
which  does  not  change  with  time. 

(a)  Show  that  all  steady  state  solutions  of  the  heat  equation  are  linear  functions 
of  x;  i.e.,  u(x)  =  Ax  +  B. 
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(b)  Find  a  steady  state  solution  of  the  boundary-value  problem 

ut  =  a2uxx\  m(0,/)=71,  u(l,t)  =  T2. 

(c)  Solve  the  heat  conduction  problem 

2  (  w(x,0)  =  75,  0<x<l 

*  |m(0,/)=20,  w(1,/)  =  60 

Hint:  Let  u(x,t)  =  v(x)  +  w(x9t)  where  t>(x)  is  the  steady  state  solution  of 
the  boundary-value  problem  ut  =  a2uxx;  u(0, t)  —  20,  w(l,/)  =  60. 

5.  (a)  The  ends  of  a  copper  rod  (a2=  1.14)  10  cm  long  are  maintained  at  0°C, 
while  the  center  of  the  rod  is  maintained  at  100°C  by  an  external  heat  source. 
Show  that  the  temperature  in  the  rod  will  ultimately  approach  a  steady  state  dis¬ 
tribution  regardless  of  the  initial  temperature  in  the  rod.  Hint:  Split  this  problem 
into  two  boundary-value  problems. 

(b)  Assume  that  the  temperature  in  the  rod  is  at  its  steady  state  distribution.  At 
time  /  =  0,  the  external  heat  source  is  removed  from  the  center  of  the  rod,  and 
placed  at  the  left  end  of  the  rod.  Find  the  temperature  in  the  rod  at  any  later 
time  t. 


6.  Solve  the  boundary-value  problem 


ut  =  uxx  +  u; 


[  w(x,0)  =  cosx, 


[  w(0,  /)  =  0, 


0<  jc<  1 
u(\,t)  =  0. 


5.7  The  wave  equation 

We  consider  now  the  boundary- value  problem 

d2u  _  2^2u  . 
a t2  c  dx2  ’ 

which  characterizes  the  propagation  of  waves  in  various  media,  and  the 
mechanical  vibrations  of  an  elastic  string.  This  problem,  too,  can  be  solved 
by  the  method  of  separation  of  variables.  Specifically,  we  will  (a)  find  solu¬ 
tions  un(x,t)  =  Xn(x)Tn(t)  of  the  boundary-value  problem 


u  (x,  0)  =/(*),  ut  (x,  0)  =  g(x) 

\  u(09t)  =  u(l,t)  =  0 


(1) 


u(0,t)  =  u(l9t)  =  0 


and  (b)  find  the  solution  u(x,t)  of  (1)  by  taking  a  suitable  linear  combina¬ 
tion  of  the  functions  un(x9t). 

(a)  Let  u(x9  t)  =  X (x)  T ( t ).  Computing 

=  XT"  and  ^  =X"T 
dt2  dx2 

we  see  that  u(x,t)  =  X(x)T(t)  is  a  solution  of  the  wave  equation  w„  =  c2 uxx 
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if  XT"  =  c2X"T,  or 


T"  X" 


c2T  X  •  (3) 

Next,  we  observe  that  the  left-hand  side  of  (3)  is  a  function  of  t  alone, 
while  the  right-hand  side  is  a  function  of  x  alone.  This  implies  that 

T"  .  X" 


c2T 


for  some  constant  A.  In  addition,  the  boundary  conditions 

0=M(0,r)  =  Ar(0)T(r),  and  0  =  u(l,t)  =  X (l)T(t) 

imply  that  X (0)  =  0  and  X (/)  =  0.  Hence  u(x,t)  =  X (x) T (t)  is  a  solution  of 
(2)  if 

.r'+AA^O;  *(0)  =  A(/)=0  (4) 

and 

T"  +\c2T =0.  (5) 

At  this  point,  the  constant  A  is  arbitrary.  However,  the  boundary-value 
problem  (4)  has  a  nontrivial  solution  X(x)  only  if  X=Xn  =  n2,n2/ 12,  and  in 
this  case, 

Ar(x)  =  A„(x)  =  sin^. 

Equation  (5),  in  turn,  implies  that 


rji  /  rri  /  .\  WITCt  ,  »  •  YVTTCt 

T(t)  =  Tn(t)  =  an  cos — r—  +6.  sin 


/ 


n 


l 


Hence, 


mx 


un(x,t)  =  sin— 


met  ,  ,  .  riTTCt 

a„  cos — : — h  sin 


wn 


l 


n 


l 


is  a  nontrivial  solution  of  (2)  for  every  positive  integer  n,  and  every  pair  of 
constants  an,bn. 

(b)  The  linear  combination 


00 


/  V'  .  mx 
u(x,t)=  sm“J“ 

n  =  \ 


met  ,  ,  .  met 

a„  cos — = — I-  b„  sin 


/ 


/ 


formally  satisfies  the  boundary-value  problem  (2)  and  the  initial  conditions 

00  00 

/  m  V  mx  a  /  V  tfmc ,  •  mx 

w(x,0)  =  2d  fl„sin— y-  and  ut(x, 0)=  2j  -y-^sin-y-. 


n—  \ 


n=\ 


Thus,  to  satisfy  the  initial  conditions  u(x,0)=f(x)  and  ut(x,0)  =  g(x ),  we 
must  choose  the  constants  a„  and  b„  such  that 


00 


00 


/v  x  V'  •  mx  j  /  \  »  •  aittjc 

/(*)  =  2jansm—  and  «W=  Zi  ~T~b„  Sin— y 

n  —  1  n  =  1 


504 


5.7  The  wave  equation 


on  the  interval  0  <  *  <  /.  In  other  words,  we  must  expand  the  functions 
f(x)  and  g(x )  in  Fourier  sine  series  on  the  interval  0<x</.  This  is  pre¬ 
cisely  the  situation  in  Theorem  3  of  Section  5.5,  and  we  conclude,  there¬ 
fore,  that 


•  nmx  j  i 
sm—j-dx  and 


K  = 


nmc 


f  g(x)* 


nmx  , 
sm — i—dx. 


I 


For  simplicity,  we  now  restrict  ourselves  to  the  case  where  g{x)  is  zero; 
that  is,  the  string  is  released  with  zero  initial  velocity.  In  this  case  the  dis¬ 
placement  u(x,t)  of  the  string  at  any  time  /  >  0  is  given  by  the  formula 


00 


u(x,t)=  ^  tf„sin 


nmx  nmct 
cos — — 


n=  1 


/ 


2  fl  r,  \  •  nmx  , 
an=j  J  f(x)sm—dx. 


(6) 


There  is  a  physical  significance  to  the  various  terms  in  (6).  Each  term  rep¬ 
resents  a  particular  mode  in  which  the  string  vibrates.  The  first  term  (n  =  1) 
represents  the  first  mode  of  vibration  in  which  the  string  oscillates  about 
its  equilibrium  position  with  frequency 


1  me  c  i  , 

=  - - —  =  —  cycles  per  second. 

7T  l  2 1 


This  lowest  frequency  is  called  the  fundamental  frequency,  or  first  har¬ 
monic  of  the  string.  Similarly,  the  nth  mode  has  a  frequency 


to 


n 


1  nmc 
2m  l 


=  Htoj  cycles  per  second 


which  is  called  the  nth  harmonic  of  the  string. 

In  the  case  of  the  vibrating  string,  all  the  harmonic  frequencies  are  in¬ 
teger  multiples  of  the  fundamental  frequency  cop  Thus,  we  have  music  in 
this  case.  Of  course,  if  the  tension  in  the  string  is  not  large  enough,  then  the 
sound  produced  will  be  of  such  very  low  frequency  that  it  is  not  in  the  aud¬ 
ible  range.  As  we  increase  the  tension  in  the  string,  we  increase  the 
frequency,  and  the  result  is  a  musical  note  that  can  be  heard  by  the  human 
ear. 


Justification  of  solution.  We  cannot  prove  directly,  as  in  the  case  of  the  heat 
equation,  that  the  function  u(x,t)  defined  in  (6)  is  a  solution  of  the  wave 
equation.  Indeed,  we  cannot  even  prove  directly  that  the  infinite  series  (6) 
has  a  partial  derivative  with  respect  to  t  and  jc.  For  example,  on  formally 
computing  ut ,  we  obtain  that 


nmc  •  nmx  .  nmet 
—j-  an  sm  -j-  sm  — — 


and  due  to  the  presence  of  the  factor  n ,  this  series  may  not  converge.  How¬ 
ever,  there  is  an  alternate  way  to  establish  the  validity  of  the  solution  (6). 
At  the  same  time,  we  will  gain  additional  insight  into  the  structure  of 
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u(x,t).  Observe  first  that 

.  mx  met  If  .  m  s  .\  .  •  m  /  ,  ,\  1 

sin— y-  cos— —  =  —  sin— (x  —  c/)  +  sm— (x  +  ct)  j. 

Next,  let  F  be  the  odd  periodic  extension  of  /  on  the  interval  —  l <x<l; 
that  is, 


/(•*)» 

-/(-*). 


0< x<  l 

-l<x<  0 


and  F(x  +  2l)  =  F(x). 


It  is  easily  verified  (see  Exercise  6)  that  the  Fourier  series  for  F  is 

00  / 
cv  \  V1  •  mx  2  fl  r(  x  •  mx  , 

F(x)  =  2j  cnsm~j~>  C„=  y  J  /(x)sin  — rfx 

n  =  1  0 

Therefore,  we  can  write  u(x,t)  in  the  form 

u  (x,  t)  =  \  [  F(x  —  ct)  +  F{x  +  c/)  ] 


and  it  is  now  a  trivial  matter  to  show  that  u(x,t )  satisfies  the  wave  equa¬ 
tion  if  f(x)  has  two  continuous  derivatives. 

Equation  (7)  has  the  following  interpretation.  If  we  plot  the  graph  of  the 
function  y  =  F{x  —  ct)  for  any  fixed  /,  we  see  that  it  is  the  same  as  the 
graph  of  y  =  F(x),  except  that  it  is  displaced  a  distance  ct  in  the  positive  x 
direction,  as  shown  in  Figures  la  and  lb.  Thus,  F(x  —  ct )  is  a  wave  which 
travels  with  velocity  c  in  the  positive  x  direction.  Similarly,  F(x  +  ct)  is  a 
wave  which  travels  with  velocity  c  in  the  negative  x  direction.  The  number 
c  represents  the  velocity  with  which  a  disturbance  propagates  along  the 
string.  If  a  disturbance  occurs  at  the  point  x0,  then  it  will  be  felt  at  the 
point  x  after  a  time  t  =  (x  —  x0)/c  has  elapsed.  Thus,  the  wave  equation,  or 
some  form  of  it,  characterizes  the  propagation  of  waves  in  a  medium  where 
disturbances  (or  signals)  travel  with  a  finite,  rather  than  infinite,  velocity. 


(b)  (a) 

Figure  1 
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Exercises 


Solve  each  of  the  following  boundary-value  problems. 


1.  u„  =  c2uxx; 

2.  u„  =  c2  uxx ; 


(i/(x,  0)  =  COSX— 1,  !/,  (*,  0)  =  0,  0<X<27 t 
t/(0,/)=0,  i/(2t7,/)  =  0 

t/(x,0)  =  0,  w/(x,0)  =  l,  0<x<l 
i/(0, /)  =  0,  11(1,0*0 


3. 


u„  =  c2uxx 

1/(0, 0  =  t/(3, 0=0  * 


0<  x  <  1, 

1  <  x  <  2 

2  <  x  <  3 


l/,(x,0)  =  0 


4. 


w(x,0)  =  xcos7rx/2,  i//(x,0)  =  0,  0<x<1 

t/ (0,0  =  0,  i/(l, 0*0 


5.  A  string  of  length  10  ft  is  raised  at  the  middle  to  a  distance  of  1  ft,  and  then 
released.  Describe  the  motion  of  the  string,  assuming  that  c2=  1. 

6.  Let  F  be  the  odd  periodic  extension  of  / on  the  interval  —  l <x<l.  Show  that 
the  Fourier  series 


2 

/ 


sin 


nmx 

l 


converges  to  F(x)  if  F  is  continuous  at  x. 

7.  Show  that  the  transformation  £= x  —  ct,  tj  =  x  +  ct  reduces  the  wave  equation  to 
the  equation  =  0.  Conclude,  therefore,  that  every  solution  u(x,t)  of  the  wave 

equation  is  of  the  form  t/(x,  /)  =  F(x  —  ct)  +  G  (x  +  ct)  for  some  functions  F  and 
G. 


8.  Show  that  the  solution  of  the  boundary-value  problem 


u„  =  c2 


u 


XX  > 


w(x,0)=/(x),  t/,(x,0)  =  g(x),  -/<*</ 

1/(0,  t)  =  u(l,t)  = 0 


is 

w(x,/)  =  \  [F(x-ct)  +  F(x  +  ct)\  +  fX  +  Ctg(s)ds 
2  2c  Jx-Ct 

where  F  is  the  odd  periodic  extension  of  /. 

9.  The  wave  equation  in  two  dimensions  is  utt  =  c\uxx  +  uyy).  Find  solutions  of 
this  equation  by  the  method  of  separation  of  variables. 

10.  Solve  the  boundary-value  problem 

r  ( t/(x,0)=/(x),  ut(x, 0)  =  0,  0 <x<l 

utt=*c  uxx  +  u:  {  , 

[  i/(0, /)=0,  «(/,/)= 0 
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5.8  Laplace’s  equation 

We  consider  now  Laplace’s  equation 

9 2u  d2u  _q 

a*2  a/ 


As  we  mentioned  in  Section  5.2,  two  important  boundary-value  problems 
that  arise  in  connection  with  (1)  are  the  Dirichlet  problem  and  the  Neu¬ 
mann  problem.  In  a  Dirichlet  problem  we  seek  a  function  u(x,y)  which 
satisfies  Laplace’s  equation  inside  a  region  R,  and  which  assumes  pre¬ 
scribed  values  on  the  boundary  of  R.  In  a  Neumann  problem,  we  seek  a 
function  u(x,y)  which  satisfies  Laplace’s  equation  inside  a  region  R,  and 
whose  derivative  in  the  direction  normal  to  the  boundary  of  R  takes  on 
prescribed  values.  Both  of  these  problems  can  be  solved  by  the  method  of 
separation  of  variables  if  R  is  a  rectangle. 


Example  1.  Find  a  function  u(x,y)  which  satisfies  Laplace’s  equation  in 
the  rectangle  0 <x<a,  0 <y<b  and  which  also  satisfies  the  boundary 
conditions 


y 

b 


w(x,0)  =  0,  u(x,b)  =  0 
u(0,y)=0,  u(a,y)=f(y) 


u  =  0 

«  =/0) 

u  =  0  a 


Solution.  We  solve  this  problem  in  two  steps.  First,  we  will  find  functions 
un(x,y)  =  Xn(x)Yn(y)  which  satisfy  the  boundary-value  problem 

uxx  +  uyy=Ql  u(x,0)  =  0,  u(x,b)= 0,  m(0,>’)=0.  (3) 

Then,  we  will  find  constants  cn  such  that  the  linear  combination 

00 

u{x,y)=  2  cnu„(x,y) 

n=  1 

satisfies  the  boundary  condition  u(a,y)=f(y). 

Step  1:  Let  u(x,y)  =  X(x)Y(y).  Computing  uxx  =  X"Y  and  uyy  =  XY'\  we 
see  that  u{x,y)  —  X(x)Y(y)  is  a  solution  of  Laplace’s  equation  if  X"Y+ 
XY”  =  0,  or 


Next,  we  observe  that  the  left-hand  side  of  (4)  is  a  function  of  y  alone, 
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while  the  right-hand  side  is  a  function  of  x  alone.  This  implies  that 

V  "  V  " 


for  some  constant  A.  In  addition,  the  boundary  conditions 

0  =  u  (x,  0)  =  X  (  jc)  Y  (0),  0  =u(x,b)  =  X(x)Y(b), 

0=u(0,y)  =  X(0)Y(y) 

imply  that  T(0)  =  0,  Y(b)  =  0,  and  Ar(0)  =  0.  Hence  u(x9y)  =  XY  is  a  solu¬ 
tion  of  (3)  if 

T"+AT=0;  T(0)  =  0,  Y(b)  =  0  (5) 

and 

X"-\X=0,  X  (0)  =  0.  (6) 

At  this  point  the  constant  X  is  arbitrary.  However,  the  boundary-value 
problem  (5)  has  a  nontrivial  solution  Y(y)  only  if  \=\n  =  n2*rr2/b2,  and  in 
this  case, 

Y ( y )  =  Yn  O)  =  sinmry/b. 

Equation  (6),  in  turn,  implies  that  Xn(x)  is  proportional  to  sinhwrx:/6. 
(The  differential  equation  X”  —  (n2^2/ b2)X  =  0  implies  that  A"(x)  = 
c , cosh mrx / b  +  c2sinhmrx / b  for  some  choice  of  constants  cl9c2,  and  the 
initial  condition  X  (0)  =  0  forces  cx  to  be  zero.)  We  conclude,  therefore,  that 


M«(^)  =  sinh 


WTX 

b 


.  nmy 
Sin— r— 


is  a  solution  of  (3)  for  every  positive  integer  n. 
Step  2:  The  function 


/  x  V'  .  u  mx  .  my 
u(x,y)=  2j  c„sinh— sin-r- 

n  —  \ 

is  a  solution  (formally)  of  (3)  for  every  choice  of  constants  cl9c29... .  Its 
value  at  x  =  a  is 

00 

/  x  V'  •  u  mta  •  my 
u{a,y)  =  2a  ^sinh  — sin— . 

n=\ 

Therefore,  we  must  choose  the  constants  cn  such  that 

00 

/(>')=  2  cnsinh^~sin“y'>  0 <y<b. 

n=\ 

In  other  words,  we  must  expand  /  in  a  Fourier  sine  series  on  the  interval 
0  <y<b.  This  is  precisely  the  situation  described  in  Theorem  3  of  Section 
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5.5,  and  we  conclude,  therefore,  that 


b  sinh 


ma 


f  f(y)  si 

J0 


sin  —7—dy> 


n  —  1,2, .. . . 


Remark.  The  method  of  separation  of  variables  can  always  be  used  to 
solve  the  Dirichlet  problem  for  a  rectangle  R  if  u  is  zero  on  three  sides  of 
R .  We  can  solve  an  arbitrary  Dirichlet  problem  for  a  rectangle  R  by  split¬ 
ting  it  up  into  four  problems  where  u  is  zero  on  three  sides  of  R  (see  Ex¬ 
ercises  1-4). 


Example  2.  Find  a  function  u(x,y )  which  satisfies  Laplace’s  equation  in 
the  rectangle  0<x<a ,  0 <y<b,  and  which  also  satisfies  the  boundary 
conditions 


y 

b 


uy(x,0)=0,  uy{x,b)=0 

« x(a,y)=f(y ) 

Uy=  0 

W*  =/0) 

- 1  - . -»  x 

uy  —  0  a 


Solution .  We  attempt  to  solve  this  problem  in  two  steps.  First,  we  will  find 
functions  un{x,y)  —  Xn{x)Yn{y)  which  satisfy  the  boundary-value  problem 

uxx  +  uyy=Q'’  Uy(x,0)=0,  uy(x,b)= 0,  and  ux(0,.y)  =  0.  (8) 

Then,  we  will  try  and  find  constants  cn  such  that  the  linear  combination 
u(x,y)  =  2  ^=ocnu„(x,y)  satisfies  the  boundary  condition  ux(a,y)  =  f{y). 
Step  1:  Set  u(x,y)  =  X (x)  Y (y).  Then,  as  in  Example  1, 


x 


X 


for  some  constant  X.  The  boundary  conditions 


0  =  (x, 0)  =  X ( x)  r (0),  0  =uy(x,b)  =  X(x)Y'(b), 
0  =  ux(0,y)  =  X'(0)Y(y) 

imply  that 

r(0)  =  0,  Y'(b)  =  0  and  X'(0)=0. 


Hence,  u(x,y)  =  X (x)Y (y)  is  a  solution  of  (8)  if 

T"+XT=0;  T'(0)=0,  y'(6)=o 
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and 

X"-\X  =  0;  X'(0)  =  0.  (10) 

At  this  point,  the  constant  A  is  arbitrary.  However,  the  boundary-value 
problem  (9)  has  a  nontrivial  solution  Y(y)  only  if  \  =  \  =  n2fir2/b2,n  = 
0,1,2,...,  and  in  this  case 


7(y)=  Yn(y)  =  cosmry/b. 


Equation  (10),  in  turn,  implies  that  X(x)  is  proportional  to  coshmx/b. 
(The  differential  equation  X "  —  n2m2X/b2  =  0  implies  that  X{x)=* 
c j  cosh  mx / b  -1-  c2sinhw7\x: / b  for  some  choice  of  constants  cl9c2,  and  the 
boundary  condition  A"'(0)  =  0  forces  c2  to  be  zero.)  We  conclude,  therefore, 
that 


un(x,y)  =  cosh 


mx 

b 


cos 


my 

~~V 


is  a  solution  of  (8)  for  every  nonnegative  integer  n. 

Step  2:  The  function 

oo 

(  \  c0  ,  V  X.rmx  my 

u(x>y)=  y  +  2jc^cosh~b~cos~b~ 

n=  1 


is  a  solution  (formally)  of  (8)  for  every  choice  of  constants  c0,cuc2,.-. .  The 
value  of  ux  at  jc  =  a  is 


00 

uxia’y)=  2 

n=  1 


nm  .  u 
—  C-Sinh 

b 


ma 


cos 


my 

T* 


Therefore,  we  must  choose  the  constants  cvc2, ...,  such  that 

00 

0  <y<b.  (11) 

n=  1 

Now,  Theorem  3  of  Section  5.5  states  that  we  can  expand  f(y)  in  the 
cosine  series 


n  =  1  L  u 


■*  „  x  ”*y  . 
f(.y)  cos  r  dy 


COS 


my 


(12) 


on  the  interval  0<  y  <  b.  However,  we  cannot  equate  coefficients  in  (11) 
and  (12)  since  the  series  (11)  has  no  constant  term.  Therefore,  the  condi¬ 
tion 


f  f(y)dy=0 
J0 

is  necessary  for  this  Neumann  problem  to  have  a  solution.  If  this  is  the 
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case,  then 

2  fb  x  nmy 

c» - .  .ma  I  f{y)^s-j-dy,  n>  1. 

wrsinh— 7—  •'o  ° 

b 

Finally,  note  that  c0  remains  arbitrary,  and  thus  the  solution  u{x9y)  is  only 
determined  up  to  an  additive  constant.  This  is  a  property  of  all  Neumann 
problems. 

Exercises 

Solve  each  of  the  following  Dirichlet  problems. 

uxx  +  Uyy  =  0  w(.x,0)  =  0,  u(x9b)  =  0 

0<x<a9  0  <y<b'  u(a9y)  =  0,  u(09y)=f(y) 

2  uxx  +  HKyms0  .  «(0,y)=0,  u(a,y)= 0 

0  <x<a9  0  <y<b’  u(x9  0)  =  0,  u(x9b)=f(x) 

Remark.  You  can  do  this  problem  the  long  way,  by  separation  of  vari¬ 
ables,  or  you  can  try  something  smart,  like  interchanging  x  with  y  and 
using  the  result  of  Example  1  in  the  text. 

3  «xx  +  ^  =  0  w(0,y)=0,  u(aty)  =  0 

0<x<a,  0  <y<b’  u(x9b)  =  09  u(x90)=f(x) 

^  Kxx  Uyy  ^  t  u(Xyb')  — 

0<x<a9  0  <y<b’  u(0,y)  =  h(y),  u(a9y)  =  k(y) 

Hint :  Write  u(x9y )  as  the  sum  of  4  functions,  each  of  which  is  zero  on  three 
sides  of  the  rectangle. 

uxx  +  Uyy  =  0  w(x,0)  =  0,  u(x9b)  =  1 

0  <x<a9  0  <y<b’  w(0,y)  =  0,  u(a9y)=  1 

^  ^xx  4”  Myy  “  ^  #  m(.X,  0)—l 

0<x<a,  0<y<&’  ^(OjjO^O,  u(a9y)=  1 

7  Wxx  +  Wkv  =  0  .  m(>,0)=1,  w(x,6)  =  1 

0<*<a,  0 <y<b’  «(0,y)=0,  u(a9y)=  1 

g  “xx  +  ^^O  .  «(x,0)=l,  w(x,h)  =  l 

0<x<a,  0  <y<b’  u(09y)  =  \9  u(a9y)=  1 

Remark.  Think\ 

9.  Solve  the  boundary-value  problem 

^xx^Kyy  ^  m(x,0)!=0,  m(x,  1)  =  0 

0<x<1,  0<y<l’  w(0,y)  =  0,  w(l,y)=y 
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10.  (a)  For  which  functions  f(y)  can  we  find  a  solution  u(x,y)  of  the  Neumann 

problem 

Uxx+“yy=Q  .  Wx(l,.y)=0,  “/(jO 

0<x<  1,  0<y<l’  Uy(x,  0)=0,  Uy(x,  1)=0 

(b)  Solve  this  problem  if  f(y)*=sm2iry. 

11.  Laplace’s  equation  in  three  dimensions  is 

tlXx  "1“  Uyy  "F  Uzz  0. 

Assuming  that  u  =  X(x)Y(y)Z(z),  find  3  ordinary  differential  equations  satis¬ 
fied  by  Xy  Y,  and  Z. 
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6.1  Introduction 

In  Section  5.5  we  described  the  remarkable  result  that  an  arbitrary  piecewise 
differentiable  function  f(x )  could  be  expanded  in  either  a  pure  sine  series  of 
the  form 


00 


/(*)  =  Z  bn  sin 


n= 1 


mix 

~T 


or  a  pure  cosine  series  of  the  form 

cin  S  mix 

f(x)  =  T  +  X  cos  —  (2) 

on  the  interval  0  <  x  <  /.  We  were  led  to  the  trigonometric  functions  ap¬ 
pearing  in  the  series  (1)  and  (2)  by  considering  the  2  point  boundary  value 
problems 

/  +  Ay  =  0,  y( 0)  =  0,  y(l)  =  0,  (3) 

and 

/'  +  Ay  =  0,  y'(0)  =  0,  y'(/)  =  0.  (4) 

Recall  that  Equations  (3)  and  (4)  have  nontrivial  solutions 


.  x  .  mix  mix 

yn(x)  =  c  sin  —  and  yn(x)  =  c  cos  — , 

respectively,  only  if  A  =  A„  =  .  These  special  values  of  A  were  called 

eigenvalues,  and  the  corresponding  solutions  were  called  eigenfunctions. 
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There  was  another  instance  in  our  study  of  differential  equations  where 
something  interesting  happened  for  special  values  of  a  parameter  L  To  wit, 
in  Section  2.8  we  studied,  either  in  the  text  or  in  the  exercises,  the  four 
differential  equations 

y”  —  2  xy  +  Xy  =  0,  (5) 

(1  —  x2)y"  —  2  xy'  +  X(X  +  l)y  =  0,  (6) 

(1  -  x2)f  -  xy1  +  A2y  =  0,  (7) 

and 

xy"  +  (1  —  x)y'  +  Xy  =  0.  (8) 

Equations  (5)— (8)  are  the  famous  Hermite,  Legendre,  Tchebycheff,  and 
Laguerre  differential  equations,  respectively.  The  Legendre,  Tchebycheff,  and 
Laguerre  equations  each  have  a  polynomial  solution  of  degree  n  if  X  =  n, 
while  the  Hermite  equation  has  a  polynomial  solution  of  degree  n  if  X  =  In. 
These  polynomials,  when  properly  normalized,  i.e.,  when  multiplied  by  a 
suitable  constant,  are  known  as  the  Hermite,  Legendre,  Tchebycheff,  and 
Laguerre  polynomials.  It  turns  out,  remarkably,  that  any  piecewise  differ¬ 
entiable  function  f(x )  can  also  be  expanded  in  a  series  of  Hermite,  Legendre, 
Tchebycheff,  and  Laguerre  polynomials,  on  an  appropriate  interval. 

There  is  a  very  pretty  theory  that  ties  together  not  only  the  trigonometric 
functions  and  the  Hermite,  Legendre,  Tchebycheff,  and  Laguerre  poly¬ 
nomials,  but  also  many  of  the  other  famous  functions  of  mathematical 
physics,  such  as  the  various  Bessel  functions.  This  theory  is  commonly  called 
Sturm-Liouville  Theory;  it  has  its  roots,  essentially,  in  an  area  of  linear 
algebra  known  as  inner  product  spaces,  and  it  is  to  this  area  that  we  now 
turn  our  attention. 


6.2  Inner  product  spaces 

Up  to  this  point,  our  study  of  linear  algebra  in  general  and  linear  vector 
spaces  in  particular  was  algebraic  in  nature.  We  were  able  to  add  two  vectors 
together  and  multiply  a  vector  by  a  constant.  By  means  of  these  operations 
we  can  define  the  geometric  concepts  of  dimension,  line,  plane,  and  even 
parallelism  of  lines.  Recall  that  the  dimension  of  a  space  V  is  the  number  of 
elements  in  a  basis,  i.e.,  the  fewest  number  of  linearly  independent  vectors  that 
span  V.  Once  we  have  the  concept  of  dimension,  we  can  define  a  line  in  V  as 
a  subspace  of  dimension  1,  a  plane  as  a  subspace  of  dimension  2,  etc.  Finally, 
two  vectors  are  parallel  if  one  is  a  constant  multiple  of  the  other. 

Many  important  geometric  concepts  of  so-called  Euclidean  geometry, 
however,  still  cannot  be  defined  for  arbitrary  linear  vector  spaces.  Specifically, 
we  have  no  way  of  formulating,  as  yet,  the  definition  of  length  of  a  vector 
and  the  angle  between  two  vectors.  To  accomplish  this,  we  need  to  super- 
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impose  on  V  some  additional  structure.  This  structure  is  known  as  an  inner 
product. 

Our  definition  of  an  inner  product  is  modelled  after  the  traditional  dot 
product  of  two  vectors  in  R2  and  R 3,  which  we  studied  in  calculus.  Recall  that 
if  x  and  y  are  vectors  in  R2  or  R3,  then 

xy  =  +  x2y2  in  R2  (1) 

and 

xy  =  *i)>i  +  x2y2  +  x3y3  in  R3  (2) 

where  xl9  x2, . . . ,  and  yu  y2, . . .  are  the  components  of  x  and  y,  respectively. 
Recall  too,  the  famous  identity,  proven  in  most  calculus  courses,  that 

x  y  =  |x||y|cos0  (3) 

where  9  is  the  angle  between  x  and  y  and 

|x|  =  (; x\  +  x2)1/2  or  (x\  +  x\  +  x2)1/2 

in  R2  and  R3,  respectively,  is  the  Euclidean  length  of  x.  Rewriting  Equation 
(3)  in  the  form 


COS0  =  -^- 

|x||y| 


enables  us  to  compute  the  angle  6  between  two  vectors.  Finally,  if  we  observe 
that 


|x|  =  (x-x)1/2  (5) 

then  both  the  length  of  a  vector  and  the  angle  between  two  vectors  can  be 
computed  from  the  dot  product  alone. 

We  are  now  ready  to  generalize  this  concept  of  dot  product.  This  general¬ 
ization  will  be  called  an  inner  product.  We  will  first  define  a  real  inner  product, 
and  then  toward  the  end  of  this  section,  a  complex  inner  product. 

Definition.  Let  V  be  a  real  vector  space.  A  real  inner  product  on  V  is  a 
real-valued  function  that  associates  with  each  pair  of  vectors  x,  y  a  real 
number,  denoted  by  <x,  y>,  that  satisfies  the  following  properties: 

(i)  <x,  y>  =  <y,  x>  for  all  x  and  y  in  V, 

(ii)  <kx,  y)  =  k<x,y>  for  all  scalars  k  and  vectors  x,  y, 

(iii)  <x  +  y,z>  =  <x,z>  +  <y,z>  for  all  x,  y,  z  in  V9 

(iv)  <x,  x)  ^  0  and  <x,  x)  =  0  only  if  x  =  0. 

It  is  customary  to  refer  to  the  vector  space  V,  together  with  some  inner 
product  <  ,  )  as  a  real  inner  product  space,  and  a  finite-dimensional  real 
inner  product  space  is  often  called  a  Euclidean  space. 
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Remark.  Observe  from  (i)  that  properties  (ii)  and  (iii)  hold  equally  well  on 
opposite  sides  of  the  inner  product  bracket,  i.e., 

<x,  ky>  =  fc  <x,  y>  and  <x,  y  +  z>  =  <x,  y>  +  <x,  z>. 

The  classic  example  of  an  inner  product  on  Rn ,  of  course,  is  the  dot  product 

<x,y>  =  x  y  =  x1y1  +  x2y2  +  •••  +  xnyn. 

It  is  easily  verified  that  the  dot  product  satisfies  properties  (i)-(iv).  Here 
are  some  additional  examples,  which  have  proven  extremely  useful  in 
applications. 


Example  1.  Let  V  be  the  space  of  all  continuous  functions  on  the  interval 
[a,  b],  and  define  the  inner  product  of  two  functions  f(x)  and  g(x)  in  V  as 


<f,9>  = 


mb 

f(x)g(x)dx. 

a 


It  is  easily  verified  that  the  definition  (6)  satisfies  properties  (i)-(iv). 


Example  2.  Let  V  be  as  in  Example  1,  and  define 

<f,9>  =  f  r(x)f(x)g(x)  dx 


(7) 


where  r(x)  is  positive  on  the  open  interval  (a,  b).  Again,  it  is  trivial  to  verify 
that  the  inner  product  (7)  satisfies  properties  (i)-(iv). 


Example  3.  Let  V  =  R3  and  set 


<x,y>  =  xxyx  +  2 x2y2  +  2x3y3  -  xxy2  -  x2y1  -  x2y3  -  x3y2. 


Properties  (ii)  and  (iii)  are  trivial  to  check,  and  property  (i)  is  simple  to  verify. 
To  check  property  (iv),  we  write 


<X,X>  =  x\  +  2x2  +  2X3  —  xxx2  —  x2x1  —  x2x3  —  x3x2 
=  x\  4-  2x\  +  2x3  —  2xxx2  —  2x2x3 
=  x\  —  2xxx2  +  x\  +  x\  —  2x2x3  +  x|  +  x3 

=  (^1  -  *2  f  +  (X2  ~  ^s)2  +  Xl, 


and  it  is  now  clear  that  <x,  x)  =  0  if  and  only  if  x  =  0. 

Let  x  and  y  be  vectors  in  an  arbitrary  inner  product  space  V,  and  suppose, 
following  Equation  (5),  that  we  define  the  lengths  of  x  and  y,  denoted  by  ||x|| 
and  ||y||,  respectively,  as 

11*11  =  <x,x>1/2,  llyll  =  <y,y>1/2,  (8) 

where  the  double  vertical  lines  are  used  to  denote  length.  It  is  certainly  very 
plausible  to  try  to  mimic  Equation  (4)  and  define  the  angle  6  between  x  and 
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y  via  the  equation 


cos  9  = 


<*,y) 
11*11  llyll 


<x,y> 

<x,x>1/2<y,y>1/2' 


Clearly,  the  right-hand  side  of  Equation  (9)  must  be  ^  1  in  order  for  this 
equation  to  have  any  meaning  at  all.  A  second  and  more  subtle  requirement 
is  that  the  quantity  ||x||,  defined  as  the  length  of  x,  really  is  a  length;  that  is,  it 
satisfies  the  geometric  properties  usually  associated  with  length.  Let  us  there¬ 
fore  take  one  last  digression  and  discuss  the  concept  of  length  of  a  vector,  or 
as  it  is  traditionally  called  in  linear  algebra,  the  norm  of  a  vector. 


Definition.  A  norm  on  a  real  or  complex  vector  space  V  is  a  real-valued 
function,  usually  denoted  by  ||  ||,  which  satisfies 

(i)  ||x||  ^  0  for  all  x  in  V  and  ||x||  =  0  only  if  x  =  0, 

(ii)  ||kx||  =  \k\  ||x||  for  all  scalars  k , 

(iii)  ||x  +  yK||x||  +  ||y||. 


Properties  (i)  and  (ii)  are  fairly  obvious  properties  of  length,  while  property 

(iii)  is  simply  the  triangle  law,  which  states  that  the  length  of  one  side  of  a 
triangle  is  always  less  than  (or  equal  to)  the  sum  of  the  lengths  of  the  other 
two  sides  (see  Figure  1). 

The  classical  example  of  a  norm  is  ordinary  length  in  Rn ;  i.e.,  if 


r 


X  = 


X 


n 


j 


then 


Nl  =  (x?  +  X  !  +  •"  +  *n)1/2 

obviously  satisfies  (i)— (iii).  Here  are  some  additional  examples,  which  have 
proven  quite  useful  in  applications. 


Figure  1 


518 


6.2  Inner  product  spaces 


Example  4.  Let  V  —  Rn  and  define 

Mi  =  l*il  +  \x2\  +  ■■■  +  l*J- 

It  is  extremely  easy  to  verify  that  ||  ||  1  is  a  norm  on  Rn. 

Example  5.  Let  V  =  Rn  and  define 

ML  =  max[|x1|,|x2|,...,|xj]. 

Again,  it  is  extremely  simple  to  verify  that  ||  defines  a  norm  on  Rn. 

Example  6.  Let  V  =  Rn  and  define 

iixiiP  =  [ix1r  +  ix2r  +  ---  +  ixwr]1^.  (io) 

It  is  extremely  simple  to  verify  that  ||  ||p,  referred  to  as  the  p  norm,  satisfies 
properties  (i)  and  (ii).  Property  (iii)  is  also  true  but  quite  difficult  to  verify, 
hence  we  will  not  do  so  here.  We  wish  to  point  out  (see  Exercise  6),  however, 
that 


II x || qq  =  lim  || x ||p, 

p~*  00 

and  this  is  the  motivation  for  the  strange  notation  || 


Example  7.  Let  V  be  the  space  of  all  continuous  functions  /(x)  on  the  interval 
[0, 1],  and  define 


ri 


o 


I /Ml  dx. 


It  is  easily  verified  that  Equation  (11)  defines  a  norm  on  V. 


Let  us  return  now  to  the  questions  raised  via  the  definitions  (8)  and  (9).  As 
mentioned  previously,  we  must  show  that  the  right-hand  side  of  Equation  (9) 
is  always  between  —  1  and  + 1  in  order  for  this  equation  to  make  sense.  But 
this  is  equivalent  to  the  inequality 

l<x>y)l  ^  <x,x>1/2<y,y>1/2 

or,  equivalently, 

I <x, y> |  ^  || x ||  ||y||.  (12) 

Equation  (12)  is  the  precise  statement  of  a  very  famous  theorem,  known  as 
Schwarz’s  Inequality,  which  we  now  prove. 

Theorem  1  (Schwarz’s  Inequality).  Let  V  be  a  real  inner  product  space  with 
inner  product  <  ,  ).  Then 


l<x,y>|<||x||  ||y|| 
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for  all  vectors  x,  y,  where  ||x||  is  defined  by  Equation  (8).  Furthermore , 
equality  holds  in  Equation  (13)  only  if  the  vectors  x  and  y  are  linearly 
dependent ,  i.e.,  y  =  kx  for  some  scalar  k. 

Proof.  There  are  several  ways  of  proving  Theorem  1,  but  we  will  choose  here 
the  proof  that  generalizes  most  easily  to  the  complex  case.  Observe  that 


<x,y>  <x,y> 

x  -  - - -y,  x  -  — - -y  )  >  0 


<y,y> 


<y,y> 


for  all  nonzero  vectors  x  and  y  in  V.  This  follows  immediately  from  property 
(iv)  of  inner  products.  Next,  using  properties  (i)-(iii)  of  inner  products,  we  can 
rewrite  Equation  (14)  in  the  form 


<X,X> 


<x,y) 

<y,y> 


<y,x> 


<x,y> 

<y,y> 


<x,y>  + 


<x,y> 

<y,y>2 


Hence, 


=  <x,x> 


2<x,y>2  |  <x,y>2 

<y>  y>  <y ,  y> 


=  <x,x> 


<x,y>2 

<y,y> 


<x,y>2 

llyll2  ' 


2 


or,  equivalently, 

I <x, y> |  || x ||  ||y||. 

Finally,  again  from  property  (iv),  equality  holds  in  Equation  (14)  only  if 

<x,y> 


<y,y> 


y  =  o 


or 


X  =  fey. 


<x,y) 

<y,  y>  ’ 


Remark  1.  The  proof  we  just  gave  assumes  that  y  ^0.  We  leave  it  to  the 
reader  to  verify  (see  Exercise  13)  Schwarz’s  Inequality  in  the  trivial  case  that 

y  =  o. 


Remark  2.  The  vector  (<x,y)/<y,y))y  introduced  in  Equation  (14)  was  not 
a  luckly  guess.  This  vector  is  actually  the  projection  of  the  vector  x  onto  the 
vector  y,  as  shown  in  Figure  2. 
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Figure  2 


The  second  question  raised  was  whether 

11*11  =  <x,x>1/2  (15) 

satisfies  properties  (i)-(iii)  of  norms.  Property  (i)  follows  immediately  from 
property  (iv)  of  inner  products.  To  verify  property  (ii),  we  compute 

|| /cx ||  =  <kx,kx>1/2  =  [k2<x,x>]1/2  =  \k\  <x,  x>1/2  =  \k\  || x || . 
Property  (iii)  is  a  bit  more  difficult  to  verify.  Observe  first  that 

l|x  +  y||2  =  <x  +  y,x  +  y> 

=  <x,  x>  +  <x,  y>  +  <y,  x>  +  <y,  y> 

=  l|x|| 2  +  2<x,y>  +  ||y||2 
^  l|x||2  +  2||x||  ||y||  +  ||y||2 
=  [||x||  +  llyll]2, 

where  the  inequality  above  follows  directly  from  Schwarz’s  Inequality.  Tak¬ 
ing  square  roots  gives  the  triangle  inequality,  and  this  completes  our  proof. 


Remark.  Schwarz’s  Inequality  is  an  extremely  powerful  tool,  as  witnessed  by 
the  preceding  proof.  Here  is  another  illustration  of  its  strength.  Let  V  be  the 
space  of  all  continuous  functions  on  the  interval  [a,  b],  and  define 


rb 


</,0>  = 


J  a 


f(x)g(x)  dx. 


Then,  from  Schwarz’s  Inequality, 


\<f,g>  1  = 


f  f(x)g(x)dx 


Ja 


ll/MII  Il0(x)ll. 


Hence, 


*b 

*b 

1/2 

*b 

f(x)g(x)  dx 

a 

< 

f2(x)  dx 

a  _ 

% 

g2(x)  dx 

a  _ 

We  defy  the  reader  to  give  a  pure  calculus  proof  of  the  inequality  (16). 


521 


6  Sturm-Liouville  boundary  value  problems 


Once  we  have  an  inner  product  space  V,  we  can  define  two  vectors  in  V  to 
be  orthogonal ,  or  perpendicular  to  each  other,  if  their  inner  product  is  zero; 
i.e.,  x  and  y  are  orthogonal  if  <x,  y)  =  0. 


Example  8.  Let  V  be  the  space  of  all  continuous  functions  on  the  interval 
[  — 7i,7i],  and  define  an  inner  product  on  V  via  the  relation 


The  set  of  functions 


</>£>  = 


fit 

f(x)g(x)dx. 


J  —it 


fk(x)  =  sin  kx ,  k  =  1,2,..., 


are  all  mutually  orthogonal;  that  is, 


<fj{x),fk{x)y  =  o, 


j  ¥=  k. 


since 


<fj(x),fk(x)}  = 


pit 


sin  jx  sin  kx  dx  =  0,  j  /  fc, 


—it 


following  the  discussion  in  Section  5.4. 


Example  9.  Let  V  be  the  space  of  all  continuous  functions  on  the  interval 
[— 1, 1],  and  define 


</,£>  = 


ri 


f(x)g{x) 


dx 


x ‘ 


for  /  and  g  in  V.  The  set  of  functions 


fk(x)  =  cos(fccos-1  x),  k  =  0,1,2,..., 
are  mutually  orthogonal  with  respect  to  <  ,  ),  i.e., 

f1  cos(y  cos-1  x)cos(fecos_1x) 

-l 


dx  =  0,  j  /  k. 


x 4 


To  establish  Equation  (17)  we  make  the  substitution 

-1 


u  =  cos  lx 


du  — 


Then  the  left-hand  side  of  Equation  (17)  becomes 

rO 


dx. 


t*  n 


cos  ju cos  kudu  =  cos  ju  cos  ku  du 

Jn  JO 


J  rn 

2 


(17) 


[cos(  j  +  k)u  +  cos  (j  —  k)u ]  du  =  0,  j  ^  k. 


o 
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Remark.  It  can  be  shown  that  the  functions  fk(x)  defined  in  Example  9  are 
the  Tchebycheff  polynomials  Tk(x). 

We  conclude  this  section  by  extending  the  notion  and  definition  of  inner 
product  to  complex  vector  spaces.  The  main  problem  we  have  to  overcome 
is  the  requirement  that  <x,x>  be  positive  for  all  nonzero  vectors  x.  For 
example,  if  V  is  the  space  of  all  complex- valued  functions 

h(t)  =  f(t)  +  ig(t\  a  ^  t  ^  h, 

then  we  cannot  define 


<huh2y 


rb 


Ja 


hi(t)h2{t)  dt 


as  in  Example  1,  since 


<M> 


a 


is  not  even  real  [for  </,  g )  /  0],  let  alone  positive. 

To  motivate  the  proper  extension,  we  return  to  the  familiar  dot  product 


<x,y>  =  x  y  =  xty2  +  x2y2  +  •••  +  xnyn  (18) 

where  x  and  y  are  real  vectors,  i.e.,  their  components  are  real  numbers.  We 
can  extend  this  definition  to  complex  vector  spaces  by  making  the  simple 
adjustment 

X  y  =  Xty2  +  x2y2  +  ■■■  +  x„y„.  (19) 

Then,  following  Equation  (19), 


<x,x>  =  xxxx  +  X2X2  +  "•  +  xnxn 

is  always  real  and  nonnegative.  The  only  thing  we  have  to  be  extra  careful 
about  now  is  that 


y-x  =  yxxx  +  •••  +  y„xn  =  <x,y> 

instead  of  <x,  y)  and 

x-fcy  =  xlkyl  +  •••  +  xnkyn  =  k<x,y> 
instead  of  k  <x,y>.  This  leads  us  to  the  following  definition. 

Definition.  Let  V  be  a  complex  vector  space.  A  complex ,  or  Hermitian,  inner 
product  on  V  is  a  complex-valued  function  that  associates  with  each  pair 
of  vectors  x,  y  a  complex  number,  denoted  by  <x,y>,  that  satisfies  the 
following  properties: 


(i)  <x,y>  =  <y,x>, 

(ii)  <kx,y>  =  k<x,y>, 
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(iii)  <x  +  y,  z>  =  <x,  z>  +  <y,  z>, 

(iv)  <x,  x>  ^  0  and  <x,  x)  =  0  only  if  x  =  0 

for  all  vectors  x  and  y  and  complex  numbers  k. 

Property  (i)  is  known  as  conjugate  symmetry  in  contrast  to  the  ordinary 
symmetry  of  the  real  case.  It  implies  that 

<x,/cy>  =  <ky,x>  =  k<  y,x>  =  fc<  y,x>  =  fc<x,  y>. 

Thus,  scalars  pulled  out  from  the  right-hand  side  of  a  complex  inner  product 
must  be  conjugated.  Finally,  we  leave  it  to  the  reader  to  verify  (see  Exercise 
14)  that 

<x,y  +  z>  =  <x,y>  +  <x,z>. 

Example  10.  Let  V  be  the  space  of  all  complex- valued  functions  h(t)  =  f(t)  + 
ig(t)  on  the  interval  [0, 1],  and  define 

(hl9h2y  =  f 

Jo 

=  [/i(0  +  igi  (*)]  [/2(  0  -  ig  2  (01  *  (20) 

Jo 

=  f  [/1W/2W  +  ffiWffzW]  ^  +  i  f  -  fi(t)g2(t)~\  dt 

Jo  Jo 

It  is  easily  verified  that  Equation  (20)  satisfies  properties  (i)— (iv)  of  our 
definition. 

Complex  inner  product  spaces  are  the  same  as  real  inner  product  spaces, 
essentially,  except  for  the  conjugate  symmetry  property,  as  noted  above.  For 
example,  we  define 

||x||  =<x,x>^2  (21) 

to  be  the  length  of  a  vector  x  in  a  complex  vector  space,  just  as  in  a  real  vec¬ 
tor  space.  The  only  thing  that  we  must  do,  though,  is  extend  the  proof  of 
Schwarz’s  Inequality  to  the  complex  case,  since  our  proof  made  use  of  the 
symmetry  property  of  real  inner  products. 

Theorem  2  (Schwarz’s  Inequality).  Let  V  be  a  complex  inner  product  space 
with  inner  product  <  ,  ).  Then 

|<x,y>|  <  ||x||  ||y||  (22) 

for  all  x,  y  in  V \  where  ||x||  is  defined  by  Equation  (21).  Furthermore ,  equality 
holds  in  Equation  (22)  if  and  only  if  the  vectors  x  and  y  are  linearly  depen¬ 
dent. 
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Proof.  Observe  from  property  (iv)  that 


<x,y) 

<y,y> 


y)^o 


for  all  vectors  x  and  nonzero  vectors  y  in  V.  Using  properties  (i)-(iii),  we  can 
rewrite  the  left-hand  side  of  Equation  (23)  in  the  form 


,  v  <x,y>.  v  <x,y>,  <x,y><x,y> 

<x,x>  -- — -< y,x>  -- — r-<x,y>  +  - — -- — ;-<y,y> 

<y,y>  <y»y>  <y,y>  <y,y> 

<x,y> 


=  <x,  x> 


x  2- 


<y,y> 

l<x,y>|2 


<x,y> 


<x,y>  <x,  y>  7 - ^ 

<x,  y>  +  - - -  <x,  y> 


<y»y> 


<y,y> 


Hence, 


l|x||2||y||2-|<x,y>|2 

llyll2 

|<x,y>|2^  ||x||2||y||2, 


and  Schwarz’s  Inequality  follows  immediately  by  taking  square  roots  in 
Equation  (24). 


Remark.  As  in  the  real  case,  our  proof  assumes  that  y  #  0.  We  leave  it  to  the 
reader  (see  Exercise  13)  to  complete  the  proof  for  y  =  0  and  to  verify  that 
equality  holds  if  and  only  if  x  and  y  are  linearly  dependent. 


Exercises 

1.  Show  that  Equation  (6)  defines  an  inner  product  on  C[a,b],  the  space  of  all 
continuous  functions  on  the  interval  [a,  ft], 

2.  Show  that  Equation  (7)  defines  an  inner  product  on  C[a,b]. 

3.  Show  that  ||x|| x  defined  in  Example  4  is  a  norm. 

4.  Show  that  Hxll^  defined  in  Example  5  is  a  norm. 

5.  Show  that  ||x||p  defined  in  Example  6  is  a  norm. 

6.  Show  that  Hxll^  =  lim  ||x||p. 

p->cc 

7.  Show  that  ||x||  =  |  -I-  2|x2|  +  4|x3|  is  a  norm  on  R3. 

8.  Show  that  |xx|  +  |x2|  +  |x3|2  does  not  define  a  norm  on  R3. 

9.  Show  that  e|Xil+l*21  does  not  define  a  norm  on  R2. 

10.  Show  that  Equation  (11)  defines  a  norm  on  C[a,fr]. 
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11.  Let  V  be  a  real  inner  product  space.  Prove  that 

<x.y>  =  ^llx  +  yll2-^llx-yll2- 

12.  Let  x  x  y  be  the  vector  with  components 

^2^3  -  X3y2,  ^3^1  -  Xty3,  Xty2  -  X2yl9 

respectively.  Show  that  x  x  y  is  perpendicular  to  both  x  and  y  under  the  inner 
product  (x,  y). 

13.  Complete  the  proof  of  Schwarz’s  Inequality  in  both  Rn  and  Cn  for  the  case  y  =  0. 

14.  Show  that 


<x,  y  +  z>  =  <x,  y>  4-  <x,  z> 

for  all  vectors  x,  y,  z  in  Cn. 


6.3  Orthogonal  bases,  Hermitian  operators 

It  is  often  the  case  that  a  crucial  step  in  the  solution  of  many  problems  in  a 
linear  vector  space  revolves  around  a  judicious  choice  of  basis.  A  type  of  basis 
that  is  especially  useful  is  one  that  is  orthogonal 

Definition.  A  set  of  vectors  is  said  to  be  orthogonal  if  the  inner  product  of  any 
two  distinct  vectors  in  the  set  is  zero. 

One  of  the  nice  things  about  an  orthogonal  set  of  vectors  is  that  they  are 
automatically  linearly  independent.  This  is  the  content  of  Lemma  1. 

Lemma  1.  Let  x1?  x2, . . . ,  xN  be  mutually  orthogonal ,  that  is, 

(xhXj}  =  0,  i±j. 

Then ,  xx ,  x2, . . . ,  xN  are  linearly  independent. 

Proof.  Suppose  that 

CiXi  +  c2x2  +  •  •  •  +  CNXN  =  0.  (1) 

Taking  inner  products  of  both  sides  of  Equation  (1)  with  Xj  gives 

Ci<xuXj}  +  C2<X2,X;>  +  •••  +  cN(xN,Xj >  =  0.  (2) 

The  left-hand  side  of  Equation  (2)  reduces  to  Cj(xj ,  x,- ).  Since  (xp  Xy>  >  0,  we 
see  that  Cj  =  0,  j  —  1,  2, . . . ,  N,  which  proves  Lemma  1. 

Another  benefit  of  working  with  orthogonal  bases  is  that  it  is  relatively 
easy  to  find  the  coordinates  of  a  vector  with  respect  to  a  given  orthogonal 
basis.  To  wit,  let  u1 ,  u2, . . . ,  u„  be  a  mutually  orthogonal  set  of  vectors  in  a  real 
n-dimensional  vector  space  V.  By  Lemma  1,  this  set  of  vectors  is  also  a  basis 
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for  V,  and  every  vector  x  in  V  can  be  expanded  in  the  form 


X  =  ct  ux  +  c2u2  +  •••  +  c„uM. 


Taking  inner  products  of  both  sides  of  Equation  (3)  with  u,-  gives  <x,u/)  = 
Cj^UpUj}  so  that 


<X,Uj) 


Example  1.  Let  V  =  R2  and  define  <x,  v)  =  x1y1  +  x2y2.  The  vectors 


“1  = 


1 

1 


and  = 


1 

1 


are  orthogonal  and,  thus,  form  a  basis  for  R2.  Consequently,  from  Equations 


(3)  and  (4),  any  vector  x  =  )  can  also  be  written  in  the  form 

*2, 


x 


X 


1 


1 


-Clllj+C2\-1 


<X,Ui>  ( l\+  <x,u2>  l  1 


(Ui.Uj)  \\)  '  <u2,u2>  V-l 

Xi  +  *2  /l\  Xi  —  X2  /  1 

2  (l)  +  2  V-l 


Given  the  benefit  of  working  with  orthogonal  bases,  the  following  question 
naturally  arises:  Does  every  n-dimensional  Euclidean  space  have  an  ortho¬ 
gonal  basis?  The  answer  to  this  question  is  a  resounding  yes  and  is  given  by 
the  following  theorem. 


Theorem  3  (Gram-Schmidt).  Every  n-dimensional  Euclidean  space  V  has  an 
orthogonal  basis. 


Proof.  Choose  a  basis  u1?  u2,...,u„  for  V.  We  will  inductively  construct  an 
orthogonal  basis  v1?  v2, . . . ,  v„  by  taking  suitable  combinations  of  the  vectors 
ux , . . . ,  u„.  To  this  end,  let  vx  =  ux  and  set 

v2  =  u2  +  Av1.  (5) 

Taking  the  inner  product  of  v2  with  vx  gives 

<*2,0  =  <U  2, Vi)  + 

so  that  v2  will  be  orthogonal  to  if  X  =  —(u2,Vi Note  that  v2 
cannot  equal  0  by  virtue  of  the  linear  independence  of  the  vectors  u1? . . . ,  u„. 
Proceeding  inductively,  let  us  assume  that  vl5  v2,...,vk  are  mutually  ortho- 
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gonal,  and  set 


vk+ 1  —  uk+i  +  ^ivi  +  *  *  *  +  ^kvk- 

The  requirement  that  \k+1  be  orthogonal  to  vx , . . . ,  \k  yields 

o  =  <Vn>v,>  =  <«*+i>y/>  +  A/v^v,.), 
which  gives  the  desired  relation 


With  this  choice  of  , . . . ,  Xk,  the  vectors  vx , . . . ,  \k,  vk+1  are  mutually  ortho¬ 
gonal.  Moreover,  vk+1  cannot  be  0  by  virtue  of  the  linear  independence  of 
, . . . ,  uk+1 .  Proceeding  inductively  until  k  =  n,  we  obtain  n  mutually  ortho¬ 
gonal  nonzero  vectors  vx , . . . ,  v„. 


Remark  1.  The  procedure  outlined  in  Theorem  3  is  usually  referred  to  as  the 
Gram-Schmidt  orthogonalization  procedure. 


Remark  2.  If  u1? . . . ,  uw  is  an  orthogonal  set  of  vectors,  and,  in  addition,  each 
of  the  vectors  u j  has  length  1,  i.e., 

IIUj-ll  =  <u,,  u j}112  =  1,  j  =  1, . . . ,  n, 

then  ul9 . . . ,  un  is  called  an  orthonormal  set  of  vectors.  It  is  a  simple  matter  to 
construct  an  orthonormal  basis  from  an  orthogonal  one:  if  u !,..., uw  is  an 
orthogonal  basis,  then 


are  orthonormal,  since 


u 


n 


Example  2.  Let  V  be  the  space  of  all  polynomials  of  degree  n  —  1,  and  define 


<J,9>  =  f(x)g(x)dx 
J  “I 

for  all  functions  /  and  g  in  V.  It  is  easily  verified  that  V  is  an  n-dimensional 
Euclidean  space  and  that 

/o(x)  =1,  fi(x)  =  X,  /„_  i(x)  =  X"-1 

form  a  basis  for  V.  Applying  the  Gram-Schmidt  orthogonalization  procedure 
to  the  functions  /0(x), . . .  t(x)  gives 


P  oM  =  i; 
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where 


/•i 


Pi(x)  =  x  +  X  =  x 


</l .  Po  > 
<Po,Po> 


xdx 


=  x  — 


rl 


-1 


=  x; 


1  dx 


p2(x )  =  x2  +  A2x  +  ^ 


ri 


A2=  -- 


x2-xdx 
~l  - =  0, 


n 


x2  dx 


rl 


x2  dx 


o  J  — i 

A1  —  m 


1  dx 


1 

3 


-l 


so  that 


with 


p2(x)  =  x2  -  ^ 

p3(x)  =  X3  +  4  (  X2  -  i  I  +  A2x  +  Ax 


A,  = 


/•l 


-1 


x3(x2  —  |)dx 


(X2  -  i)2  dx 


rl 


—  0,  X2  — 


x3  *  x  dx 


-l 


rl 


x2  dx 


3 

5’ 


-l 


and 


Ai  = 


n 

x3  dx 

_1 - =  0 


ri 


-l 


1  dx 


so  that 


p3(x)  =  x3  --X. 

These  polynomials  p0(x),  Pi(x),  p2(x),  p3(x),...,  when  properly  normalized, 
i.e.,  when  multiplied  by  a  suitable  constant,  are  the  Legendre  Polynomials 
Pk(x)  discussed  in  Chapter  2. 

We  now  digress  for  a  moment  to  discuss  the  following  seemingly  totally 
unrelated  problem.  Let  V  =  Cn  with  inner  product 

<x,  y>  =  (x,  y)  =  x1y1  +  x2y2  +  •  •  •  +  x„yn 

where  and  y1 , . . . ,  y„  are  the  coordinates  of  x  and  y,  respectively,  and 
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let  A  be  a  given  n  x  n  matrix.  Does  there  exist  a  matrix  B  such  that 

(Ax,  y)  =  (x,  By) 

for  all  vectors  x  and  y? 

To  answer  this  question,  we  let  A  =  (ay),  B  =  (h0)  and  observe  that 


n  /  n 

04x,y)=  X  X  atJXi  )^=  X 

j=i  \i=i  /  >,j=i 


n 


and 


W 


n 


n 


(x,  By)  =  X  x;  (  X  bijyj )  =  X 

i  =  l  \j  =  l  /  i,J  =  l 

Hence,  Equation  (7)  is  satisfied  if  and  only  if  b =  an  or,  equivalently, 


bu  =  an- 


(7) 


(8) 


Definition.  The  matrix  B  whose  elements  are  given  by  Equation  (8)  is  called 
the  adjoint  of  A  and  is  denoted  by  A*.  If  A*  =  A ,  then  A  is  said  to  be 
selfadjoint  or  Hermitian ,  and 

(Ax,  y)  =  (x,  Ay)  for  all  x  and  y. 

Remark  1.  If  A  is  real,  then  A*  is  simply  AT ,  the  transpose  of  A.  Thus,  if  A  is 
real  and  symmetric,  i.e.,  A  =  AT ,  then  A  is  Hermitian. 

Remark  2.  It  is  a  simple  matter  to  show  (see  Exercise  6)  that  (v4*)*  =  A  and 
that 

(x,  Ay)  =  (A*x,  y)  for  all  x  and  y. 


Example  3.  Let 


Then, 


'1  +  i 

2 

l 

A  = 

3 

• 

—  i 

1  —  i 

• 

l 

2  i 

3  i 

J 

'l-i 

3 

—  i 

A*  = 

2 

i 

-2  i 

• 

—  1 

1  + 

i  —  3  i 

Example  4.  Let 


A  = 


2 

3 

4 


3  4 

4  5 

5  6 
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Then  A*  =  AT  =  A. 

Selfadjoint  matrices  have  several  very  important  properties,  which  we  now 
wish  to  describe. 

Property  1.  Let  A  be  selfadjoint  and  let  x  be  an  eigenvector  of  A  with 
eigenvalue  X.  Since  A  is  selfadjoint,  (Ax,x)  =  (x,i4x),  but 

(v4x,x)  =  (Ax,x)  =  A(x,x), 

while 


(x,^4x)  =  (x,Ax)  =  A(x,x). 

Hence,  X  =  X. 

Corollary.  The  eigenvalues  of  a  real  symmetric  matrix  are  real 

Property  2.  Let  A  be  selfadjoint,  and  let  ul  and  u2  be  eigenvectors  of  A  with 
eigenvalues  Xx  and  X2,  respectively,  with  Xx  #A2.  Then,  ut  and  u2  are 
orthogonal,  i.e.,  (u^u^  =  0. 

Proof.  Observe  first  that 

(^U!,U2)  =  (AtUi.Uj)  = 

Since  A  is  selfadjoint, 

(i4ul9u2)  =  (u  l9Au2)  =  (uuX2u2)  =  X2(uuu2) 
since  X2  is  real.  Hence, 

ll(Ui,U2)  =  A2(U!,U2),  (9) 

and  since  Xx  ^  X2,  Equation  (9)  now  forces  (u1?u2)  =  0,  which  proves  Prop¬ 
erty  2. 

In  actuality,  we  can  state  a  much  stronger  property  than  Property  2:  Not 
only  are  the  eigenvectors  belonging  to  distinct  eigenvalues  of  a  selfadjoint 
matrix  orthogonal,  but  every  n  x  n  selfadjoint  matrix  posesses  a  full  comple¬ 
ment  of  n  mutually  orthogonal  eigenvectors.  This  is  the  content  of  Property  3. 

Property  3.  Every  n  x  n  selfadjoint  matrix  A  has  n  mutually  orthogonal 
eigenvectors. 

Proof.  Every  matrix  has  at  least  one  eigenvector.  Let  vx  be  an  eigenvector  of 
A  with  eigenvalue  Xu  and  let  W  be  the  subspace  of  V9  which  consists  of  all 
vectors  in  V  that  are  orthogonal  to  vlm  We  leave  it  to  the  reader  to  verify  (see 
Exercise  7)  that  IT  is  a  linear  subspace  of  V  of  dimension  n  —  1.  The  crucial 
observation  now  is  that  A  takes  W  into  itself ;  i.e.,  if  x  is  orthogonal  to  ,  then 
so  is  Ax.  But  this  follows  immediately  from  the  observation  that 

(Ax,  Vj)  =  (x,  A*Vi)  =  (x,  Avj)  =  (x^Vj)  =  A1(x,v1). 
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Thus,  (x^)  =  (^(Ax^)  =  0. 

The  next  part  of  our  proof  is  a  little  tricky.  Recall  from  Section  3.7  that 
every  n  x  n  matrix  A  induces  a  linear  transformation  #  on  Cn{Rn)  with 

&  (x)  =  Ax.  (10) 

Conversely,  every  linear  transformation  &  on  Cn(Rn)  can  be  written  in  the 
form  (10)  for  some  appropriate  matrix  A.  Let  &  be  the  linear  transformation 
induced  by  A *.  From  the  remarks  above,  &  takes  W  into  itself.  Thus,  assum¬ 
ing  that  (see  Exercise  8)  every  linear  transformation  that  takes  a  vector  space 
into  itself  has  at  least  one  eigenvector,  we  see  that  &  has  at  least  one  eigen¬ 
vector  v2  in  W;  i.e., 

&  (v2)  =  Ay  2  =  h2y  2 

for  some  number  X2-  Proceeding  inductively,  that  is,  by  considering  the 
subspace  of  W  perpendicular  to  v2,  we  can  produce  n  eigenvectors  v1?  v2, . . . , 
yn  of  A  that  are  mutually  orthogonal,  and  this  completes  the  proof  of 
Property  3. 

Corollary.  Every  symmetric  n  x  n  matrix  A  has  n  mutually  orthogonal  eigen¬ 
vectors. 

At  the  beginning  of  this  section  we  mentioned  that  orthogonal  bases  were 
often  very  useful  in  applications.  In  Chapter  3,  we  observed  that  a  basis  that 
consists  of  eigenvectors  of  a  matrix  A  is  also  very  useful  since  in  this  basis  the 
matrix  is  diagonal.  The  ideal  situation  in  practice  is  a  basis  of  eigenvectors 
of  a  selfadjoint  matrix  A ,  for  then  we  have  the  best  of  both  worlds,  i.e., 
the  eigenvectors  form  an  orthogonal  basis.  This  is  the  real  secret  behind  the 
famous  Sturm-Liouville  Theory  to  which  we  now  turn  our  attention. 


Exercises 

1.  Let  V  be  the  space  of  all  polynomials  of  degree  ^  2  on  the  interval  [  —  2,2],  with 
inner  product 

</>  9)  =  J_2  x2f(x)g(x)dx. 

Starting  with  the  basis  1,  x,  and  x2,  use  the  Gram-Schmidt  procedure  to  construct 
an  orthonormal  basis  for  V. 

2.  Let  V  be  the  space  of  all  polynomials  of  degree  <  n,  on  the  interval  [a,  b],  and  define 

</>0>  =  [  r(x)f{x)g(x)dx. 

J  a 

Use  the  Gram-Schmidt  procedure  to  obtain  an  orthogonal  basis  for  V  in  the  case 

(a)  ( a,b )  =  (- 1, 1), r(x)  =  |x|1/2; 

(b)  ( a,b )  —  (—oo,  oo ),  r(x)  —  e~x. 
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3.  Obtain  the  first  four  Hermite  polynomials  by  applying  the  Gram-Schmidt  proce¬ 
dure  to  the  monomials  1,  x,  x2,  and  x3,  with  weight  function  r(x)  =  e~x2/ 2. 

4.  Let  V  be  the  n-dimensional  space  spanned  by  the  n  functions  e~kx ,  k  =  0,  1,  . .., 
n  —  1,  and  let 


(/>  9)  = 


(•  00 


0 


f(x)g(x)  dx 


for  /,  $  in  V.  Obtain  an  orthogonal  basis  for  V. 

5.  Obtain  the  first  four  Laguerre  polynomials  by  applying  the  Gram-Schmidt  proce¬ 
dure  to  the  monomials  1,  x,  x2,  and  x3,  with  weight  function  r(x)  =  e~x. 


6.  Show  that  (A*)*  =  A. 


7.  Show  that  the  subspace  W  introduced  in  the  proof  of  Property  3  has  dimension 
n  —  1. 


8.  Show  that  every  linear  transformation  that  takes  a  vector  space  into  itself  has  at 
least  one  eigenvector. 


6.4  Sturm-Liouville  theory 

In  Chapter  5  we  studied  equations  of  the  form 

LW(x)  =  /(x)  +  Ay(x)  =  0  (1) 

subject  to  the  boundary  conditions 

a^iO)  +  b^'iO)  =  0,  a2y(l)  +  b2y'(l)  =  0.  (1') 

Equation  (1)  together  with  the  boundary  conditions  (1')  was  referred  to  as  a 
boundary  value  problem.  We  then  stated  a  theorem  to  the  effect  that  this 
boundary  value  problem  has  a  nontrivial  solution  only  for  a  special  set  of 
numbers 

<  A2  <  A3  <  •  •  •  <  Xn  <  •  •  • 

called  eigenvalues  and  every  differentiable  function  y(x)  can  be  expanded  in  a 
series  of  these  eigenfunctions,  i.e., 

00 

y(x)  =  Z  C»^W>  o <x<i. 

n=l 

For  example,  the  boundary  value  problem 

y"(x)  +  ly(x)  =  0,  y(0)  =  0,  y(l)  =  0,  (2) 

has  eigenvalues  /.„  =  n2n2jl2,  with  corresponding  eigenfunctions 


y„(x)  =  sin 


nnx 

~T’ 
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and  every  differentiable  function  f(x)  can  be  expanded  in  a  Fourier  series  of 
the  form 


y  (x)  =  X  cn sin  ~r  >  0  <  x  <  l. 

n= 1  * 

The  theorem  we  just  stated  is  a  special  case  of  a  much  more  general  theorem, 
which  applies  to  the  more  general  equation 

L[y]  (x)  =  a(x)y"(x)  +  b(x)y'(x)  +  c(x)y(x)  =  -Xr(x)y(x)  (3) 

and  the  boundary  conditions 

«ij;(a)  +  ^/(a)  =  0,  a2y(p)  +  b2y’(P)  =  0.  (3') 

We  can  gain  a  lot  of  insight  into  this  problem  by  studying  it  from  a  linear 
algebra  point  of  view.  To  wit,  let  V  be  the  space  of  all  twice  continuously 
differentiable  functions.  V  is  a  linear  vector  space  but  is  infinite  dimensional, 
rather  than  finite  dimensional.  Next,  define  an  inner  product  on  V  via  the 
relation 


(/» 9 )  =  f(x)g(x)  dx.  (4) 

Joe 

The  operator  L  defined  by 

L[y]{x)  =  a(x)y"(x)  +  b(x)y'(x)  +  c(x)y(x)  (5) 

is  a  linear  operator  defined  on  V  (see  Section  2.1)  and  it  is  natural  to  ask 
whether  L  is  self  adjoint  in  the  sense  that 

(Lm,  v)  =  (w,  Lv)  (6) 

for  all  functions  m,  v  in  V.  The  answer  to  this  question  is  given  in  Theorem  4. 


Theorem  4.  Let  V  be  the  space  of  all  twice  continuously  differentiable  functions 
on  the  interval  [a,/?].  Then ,  the  operator  L  defined  by  Equation  (5)  is 
selfadjoint  on  V9  in  the  sense  of  Equation  (6),  if  and  only  if 


b(x)  =  a'(x). 


Proof.  Computing 

rfi 

(Lu,  v)  —  [( a(x)u"(x )  +  b(x)u'(x)  +  c(x)u(xj]v(x)  dx 

Ja 

and 


(«,  Lv) 


rP 

[i a(x)v"(x )  -f  b(x)v'(x)  +  c(x)v(xj]u(x) dx 

a 


we  see  that 
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(Lw,  v)  —  (m,  Lv)  = 


P 


[a(x)v(x)u"  (x)  +  b(x)v(x)u'  (x)]  dx 


CL 


rP 


[i a(x)u(x)v"(x )  +  b(x)u(x)v'(xy\  dx. 


JOL 


Integrating  the  terms  containing  u”(x)  and  i/'(x)  by  parts  gives 


(Lw,  v)  —  (m,  Lv)  =  — 


+ 


+ 

+  a(x)v(x)u'(x)  |f  —  a(x)u(x)v'(x)  |f 
rp 


*p 

a(x)u'(x)v'(x)dx  — 

«  OL  ft 

rP 

a'(x)v(x)u'(x)  dx 

'a 

<*  p 

'P 

b(x)v(x)u'(x)dx  + 

Joe  * 

a(x)u'  (x)vf  (x)  dx 

oe 

rfi 

*P 

a'(x)u(x)v'(x)dx  — 

Joe  , 

b(x)u(x)v'(x)  dx 

a 

[b(x)  —  a'(x) ]  [i;(x)w'(x)  —  m(x)i/(x)]  dx 
+  a(x)[r(x)M'(x)  —  w(x)i/(x)]£. 


(8) 


The  boundary  terms  in  Equation  (8)  are  zero.  To  see  this,  assume  that  the 
constants  bu  b2  in  Equation  (3')  are  not  zero  (the  proof  is  trivial  if  they  are — 
see  Exercise  1),  and  set  cx  =  a1/b1,  c2  =  a2/b2.  Then 

a(x)[v(x)u'(x)  —  m(x)i/(x)]£ 

=  “(P)\ ~XP)u'(p)  -  u(j?)i?'(j8)]  -  a(a)[v(cL)u'{oi)  -  w(a)i/(a)];  (9) 
but 


U'(P)  =  c2u(p),  v'(p)  =  c2v{P)9  w'(a)  =  CiM(a),  i/(a)  =  c^a) 

since  m(x)  and  v(x)  satisfy  the  boundary  conditions  (3').  Hence,  the  right-hand 
side  of  Equation  (9)  becomes 

a(P)lv(P)c2u(P)  -  u(P)c2v(P)]  -  aWlvWc^ioc)  -  H^c^a)]  =  0. 
Thus,  we  have  shown  that 


rP 

( Lu ,  v)  —  (m,  Lv)  =  [ b(x )  —  a'(x)]  [y(x)u'(x)  —  u(x)v'(x) ]  dx 

Joe 


for  all  u  and  v  in  V.  The  right-hand  side  of  Equation  (10)  is  zero  if  b(x)  =  a'(x). 
Thus,  L  is  certainly  selfadjoint  in  the  sense  of  Equation  (6)  if  b(x)  =  a'(x).  In 
addition,  it  is  intuitively  clear  that  b(x)  =  af(x)  is  also  a  necessary  condition 
for  L  to  be  selfadjoint;  the  right-hand  side  of  Equation  (10)  cannot  vanish  for 
all  u  and  v  in  V  if  b(x)  ^  a'(x).  A  proof  of  this  assertion  is  outlined  in  the 
exercises. 
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In  summary,  the  boundary  value  problem  (3),  (3')  is  self-adjoint  if  and  only 
if  b(x)  =  a'(x).  Setting  a(x)  =  p(x)  and  c(x)  =  —  q(x),  we  can  now  write  all 
selfadjoint  boundary  value  problems  in  the  canonical  form 

=  [pWy'WI  -  q{x)y(x)  =  -ir(x)y(x)  (li) 

fliJ'(a)  +  &i/(a)  =  0,  a2y(P)  +  b2y'(P)  =  0.  (II') 

Equation  (11)  together  with  the  boundary  conditions  (IT)  is  often  referred  to 
as  a  Sturm-Liouville  boundary  value  problem. 

Remark  1.  In  the  notation  of  Equation  (11),  Equation  (8)  becomes 

(Lu,v)  —  (u,Lv)  =  —  p(x)[u'(x)v(x)  —  u(x)i/(x)]£.  (12) 

Equation  (12)  is  known  as  Lagrange’s  identity,  and  if  u  and  v  satisfy  the 
boundary  conditions  (11'),  then  Lagrange’s  identity  reduces  to 

(Lm,  v)  —  (w,  Lv)  =  0.  (13) 

Remark  2.  We  have  derived  Equation  (13)  in  the  case  that  u  and  v  are  both 
real,  because  that  is  how  Lagrange’s  identity  is  usually  expressed.  It  is  a  simple 
matter,  however,  to  verify  (see  Exercise  3)  that  Equation  (13)  is  also  true  for 
u  and  v  complex,  provided  we  define 

(f,g)=  f(x)g(x)  dx.  (14) 

J  a 

Definition.  A  Sturm-Liouville  boundary  value  problem  is  said  to  be  regular 
if  each  of  the  following  conditions  hold: 

(i)  r(x)  >  0  and  p(x)  >  0  for  x  e  [a,  /?]. 

(ii)  p(x),  p'(x),  g(x),  and  r(x)  are  continuous  on  [a,/?]. 

(iii)  (a,  P)  is  finite. 

We  are  now  ready  to  state  and  prove  the  main  theorem  of  this  chapter. 

Theorem  5.  For  any  regular  Sturm-Liouville  boundary  value  problem : 

(1)  All  the  eigenvalues  (and  consequently  all  the  eigenfunctions )  are  real. 

(2)  Eigenfunctions  belonging  to  different  eigenvalues  are  orthogonal  under 
the  inner  product 

</,0>=f  r(x)f(x)g(x) dx.  (15) 

(3)  To  each  eigenvalue ,  there  corresponds  one ,  and  only  one ,  eigenfunc¬ 
tion. 

(4)  There  are  a  countably  infinite  number  of  eigenvalues  A2, .  • .  with 
corresponding  eigenfunctions  y0(x)>  yi(xX  y2(xX---  •  The  eigenvalues  Xn  can 
be  ordered  so  that  n  refers  to  the  number  of  zeros  of  yn  in  the  interval  [a,  /?]. 
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In  addition ,  the  zeros  of  yn+1  interlace  those  of  yn.  Finally ,  X0  <  X1  < 
*2  <  V*  •  wztb  ->  oo  as  n  -+  oo. 

(5)  Let  /(x)  be  any  continuously  differentiable  function  on  the  interval 
[a,  /?].  T/zcn,  /(x)  can  be  expanded  in  a  convergent  series  of  the  eigenfunc¬ 
tions  of  L;  z.e., 


f(x)  =  X 
«  =  0 

where 


a 


n 


*P 

r(x)f(x)yn(x)  dx 
Ja 


K*)yn  (*)<** 


Proof.  (1)  The  proof  of  (1)  follows  immediately  from  the  selfadjointness  of  L: 
if  Ly  =  —  Xy  then 


(Ly,y)  =  ( -  Xry,  y)  =  -X 


rP 


J  a 


r(x)\y2(x)\dx 


and 


( y,Ly )  =  ( y ,  —  Ary)  =  -X  r(x)\y2(x)\dx  (18) 

Ja 

since  r(x)  is  real.  Equating  Equations  (17)  and  (18),  we  see  that  X  must  be  real. 

(2)  Let  yn(x)  and  ym(x)  be  eigenfunctions  of  L  with  eigenvalues  Xn  and  Xm, 
respectively,  with  Xn  /  Xm.  Then 


(Lyn,ym)  =  (-Xnryn,ym) 


n 


r(x)yn(x)ym(x)dx; 


Joe 


however,  since  L  is  selfadjoint, 


(Ly„,  ym)  =  (y„,  LyJ 

=  (y«,  -  *mrym) 


=  —X 


rfi 


m 


'•(x)y„(x)ym(x)  dx. 


J  a 


and  since  Xn  ^  Xm,  we  see  that 


rP 


r{x)yn(x)ym(x)dx  =  (yn,ym)  =  0. 


(3)  Suppose  that  ux(x)  and  u2(x)  are  two  independent  eigenfunctions 
with  eigenvalue  X.  Then  both  ux  and  u2  satisfy  the  second-order  differential 
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equation 


(p(x)/)'  -  q(x)y  =  -lr(x)y 


in  addition  to  the  boundary  conditions 

a1  y(cc)  +  fci/(a)  =  0,  a2y(P )  +  b2y'(p)  =  0.  (20) 

Since  ux  and  u2  are  independent,  they  form  a  basis  for  all  the  solutions  of 
Equation  (19),  that  is,  every  solution  y(x)  of  Equation  (19)  can  be  written  in 
the  form 


y(x)  =  CiU^x)  4-  c2u2(x)  (21) 

for  some  choice  of  constants  cl9  c2.  Finally,  observe  that  if  ux(x)  and  u2(x) 
satisfy  the  boundary  conditions  (20),  then  so  does  the  linear  combination  (21). 
Hence,  every  solution  of  Equation  (19)  must  satisfy  the  boundary  conditions 
(20),  which  is  clearly  absurd.  Thus,  to  each  eigenvalue  X  of  L,  there  cor¬ 
responds  only  one  eigenfunction. 

(4)  The  proof  of  (4),  though  not  very  difficult,  is  quite  long,  so  it  will  be 
omitted  here. 

(5)  The  convergence  part  of  (5)  is  too  difficult  to  present  here.  We  wish  to 
point  out  though,  that  if  f(x)  can  be  expanded  in  a  series  of  the  eigenfunctions 
yn(x),  i.e., 


OO 

/(*)  =  £  a„y„(x) 


n= 0 


then  an  must,  of  necessity,  be  given  by  Equation  (16).  To  see  this,  assume  that 
the  series  (22)  is  convergent  and  that  the  inner  product  of  /  with  any  function 
y(x)  can  be  distributed,  that  is, 

OO 

<f,y>  =  Z  a«<y«>y>-  (23) 

n= 0 

If  y(x)  =  yj(x),  then  Equation  (23)  reduces  to 

<f,yj>  =  ajiyj,yj')  (24) 

by  virtue  of  the  orthogonality  of  the  eigenfunctions  y0(x)>  )>i  (*)9  yi(xX  •  •  •  >  and 
it  now  follows  immediately  from  Equation  (24)  that 


a > 


<f,yj>  _ 

<yj*yj> 


r(x)f(x)yj(x)  dx 


a 


P 


r(x)yf(x)dx 


Part  (5)  of  Theorem  5  deals  with  pointwise  convergence,  i.e.,  the  series  (22) 
converges  for  each  x  in  (a,  /?)  if  f(x)  is  continuously  differentiable.  A  different 
type  of  convergence,  which  has  proven  quite  useful  in  applications,  is  known 
as  convergence  in  the  mean. 
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Definition.  A  sequence  of  functions  /„(x),  a  <  x  <  /?,  is  said  to  converge  in 
the  mean  to  /(x)  if 


11/ -/j  = 

approaches  zero  as  n  -►  0. 


rP 


r(x)(f(x)  -  fn(x))2  dx 


a, 


1/2 


Part  (5)  of  Theorem  5  now  has  the  following  analogue  in  terms  of  mean 
convergence. 


Theorem  6.  Let 


fn(x)  =  f,  ajy/x)  (26) 

J=  0 

where  aj  is  given  by  Equation  (25).  Then ,  fn(x)  converges  to  f(x)  in  the  mean 
for  all  functions  f(x)  with 


rP 


Joe, 


r(x)f2(x)dx  <  oo, 


i.e.,  for  all  functions  f(x)  with  finite  norm. 


Example  1.  Consider  the  Sturm-Liouville  boundary  value  problem 

Lly\  =  /  =  -Ay,  y\  0)  =  0,  /( 1)  =  0.  (27) 

Here  p(x)  =  r(x)  =  1  and  q(x)  =  0.  It  is  easily  verified  that 

y  =  y„(x)  =  cos  nnx,  n  =  0,  1,  2, ... , 

are  the  nontrivial  solutions,  i.e.,  eigenfunctions,  of  Equation  (27),  with  cor¬ 
responding  eigenvalues 

Xn  =  n2n2,  n  =  0,  1,  2, ... . 

Parts  (1)— (3)  of  Theorem  5  are  trivial  to  verify  and,  in  fact,  have  been  done 
already  in  Chapter  5.  The  zeros  of  yn(x)  are  located  at  the  n  points 

13  5  In  -  1 

2  n  2  n  2  n  2  n 

and  it  is  easily  verified  that  the  zeros  of  yn+1  interlace  those  of  yn.  For  example, 
the  four  zeros  |,  f ,  and  |  of  y4(x)  interlace  the  three  zeros  f ,  |  of  y3(x)  since 

1  1  3  3  5  5  7 

<  <  <  <  <  < 

8  6  8  6  8  6  8 

Finally,  part  (5)  of  Theorem  5  is  simply  Fourier’s  Theorem  of  Section  5.4. 

Let  us  return  now  to  the  Hermite,  Legendre,  Tchebycheff,  and  Laguerre 
differential  equations 
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y"  -  xy'  +  Ay  =  0, 

-00  <  X  <  00, 

(28) 

(1  —  x2)y"  —  2  xy'  +  X(X  +  l)y  =  0, 

-  1  <  X  <  1, 

(29) 

(1  -  x2)y"  -  +  X2y  =  0, 

-  1  <  X  <  1, 

(30) 

xy"  +  (1  —  x)y'  4-  Xy  =  0,  0  <  x  <  oo, 

(31) 

respectively.  Except  for  the  Legendre  equation  (29),  the  coefficient  of  y'  in 
each  of  these  equations  is  not  the  derivative  of  the  coefficient  of  We  remedy 
this  situation  by  multiplying  through  by  an  appropriate  integrating  factor 
(see  Section  1.2).  Then  we  can  rewrite  Equations  (28)— (31)  in  the  equivalent 
form 

£[y](x)  =  dxe  x  /2y'(x)  =  Xe  x  l2y(x). 

—  00  <  X  <  00, 

(28') 

£[y](x)  =  dx(l  -  X2)y'(x)  =  -X(X  +  l)y(x), 

—  1  <  X  <  1, 

(29') 

L[y](x)  =  dxV  ~  x2)1/2y'(x)  =  (1  _  x2)i/2 :»(*), 

—  1  <  X  <  1, 

(30') 

£[y](*)=  ,  xe  Xy'(x)  =  Xe  Xy(x), 
ax 

0  <  x  <  oo. 

(31') 

Indeed,  we  can  now  begin  to  understand  the  presence  of  the  factor  r(x)  in  the 
Sturm-Liouville  problem 

£[>]  (x)  =  -Ar(x)y(x). 

Our  next  step  is  to  determine  what  boundary  conditions  should  be  im¬ 
posed  on  the  solutions  of  Equations  (28)— (31).  To  this  end,  observe  that  none 
of  the  Equations  (28')— (31')  are  regular  in  the  sense  of  the  definition  preceding 
Theorem  5.  To  wit,  the  functions 

p(x)  =  1  —  x2  and  p(x)  =  (1  —  x2)1/2 

for  the  Legendre  and  Tchebycheff  equations  vanish  at  the  end  points  —  1  and 
+ 1  (which  in  a  limiting  sense  is  also  true  for  the  functions  e~x2/2  and  xe~x, 
which  occur  in  the  Hermite  and  Laguerre  equations),  and  the  interval  (a,/?) 
for  both  the  Hermite  and  Laguerre  equations  is  not  finite. 

Let  us  return  now  to  our  proof  of  Theorem  4  where  we  derived  the  relation 

(Lu,v)  —  ( u,Lv )  =  —p(x)[u'(x)v(x)  —  u(x)v'(x)]%.  (32) 

For  the  operators  L  appearing  in  Equations  (28)— (31),  the  integrals  appearing 
in  the  left-hand  sides  of  Equation  (32)  may  now  have  to  be  considered  as 
improper  integrals.  In  the  case  of  the  Legendre  and  Tchebycheff  equations, 
the  function  p(x)  vanishes  at  the  end  points.  Thus,  L  will  certainly  be  self- 
adjoint  if  we  impose  the  boundary  conditions  that  y'(x)  [and  hence  y(x),  see 
Exercise  12]  is  bounded  as  x  -►  ±  1.  In  the  case  of  the  Hermite  equation,  we 
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require  that 

lim  e~x2/2[u(x)v'(x)  —  u'(x)v(x) ]  =  0.  (33) 

X~*  ±00 

In  the  case  of  the  Laguerre  equation,  p(x)  vanishes  at  x  =  0.  Thus,  one 
boundary  condition  is  that  y'(x)  [and  hence  y(xj]  is  bounded  as  x  -►  0.  The 
second  boundary  condition  is  determined  from  the  requirement  that 

lim  xe~x\u(x)v'(x)  —  w'(x)i?(x)]  =  0.  (34) 

jc-+oo 

Finally,  we  note  that  the  conditions  (33)  and  (34)  translate  into  the  boundary 
conditions 

lim  e~x2,2y(x)  =  0  and  lim  xe~xy(x)  =  0 

X-*  ±  GO  X-*00 

for  the  Hermite  and  Laguerre  equations,  respectively. 

It  is  interesting  to  note  (but  difficult  to  prove)  that  the  boundary  conditions 
imposed  on  the  solutions  of  Equations  (28)— (31)  are  exactly  sufficient  to  force 
the  power  series  solutions  of  these  equations  to  terminate  at  a  finite  point.  In 
other  words,  the  only  solutions  of  Equations  (28)— (31)  that  satisfy  our  bound¬ 
ary  conditions  are  the  Hermite,  Legendre,  Tchebycheff,  and  Laguerre  poly¬ 
nomials,  respectively;  or,  in  the  language  of  linear  algebra,  the  only  nontrivial 
solutions  of  the  equation 

£[y](x)  =  -Ar(x)y(x) 

for  the  operators  L  in  Equations  (28')— (31')  and  the  boundary  condi¬ 
tions  discussed  above  are  the  eigenfunctions  y„(x)  with  eigenvalues  Xn,  and 
these  eigenfunctions,  when  properly  normalized,  are  the  Hermite,  Legendre, 
Tchebycheff,  and  Laguerre  polynomials,  respectively. 

Remark.  We  have  imposed  boundary  conditions  on  Equations  (28)— (31)  to 
ensure  that  the  respective  operators  L  are  selfadjoint.  This  will  guarantee  that 
parts  (1)  and  (2)  of  Theorem  5  are  true.  More  generally,  parts  (1)  and  (2)  of 
Theorem  5  are  true  even  for  singular  Sturm-Liouville  problems,  but  parts 
(3)-(5)  may  sometimes  fail  to  be  true,  even  though  they  are  true  for  the 
Hermite,  Legendre,  Tchebycheff,  and  Laguerre  equations. 

We  conclude  this  section  by  proving  that  the  only  nontrivial  solutions  of 
the  Hermite  equation 

y”  —  2 xy'  4-  Xy  =  0  or  (e~x2/2y'y  =  —  Xe~x2/2y  (35) 

satisfying  the  boundary  conditions 

lim  e~x2/2y(x)  =  0  (36) 

X-*  ±00 

are  the  Hermite  polynomials  Hn(x ),  which  are  the  polynomial  solutions  of  the 
equation 

y"  —  2  xy'  +  2  ny  =  0. 
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In  other  words,  all  nonpolynomial  solutions  of  Equation  (35)  fail  to  satisfy 
the  boundary  conditions  (36).  To  this  end  we  set 


00 

y(x)  =  £  a„x”. 

n= 0 

Computing 

oo  oo 

y'(x)  =  Yj  nanxn  l  and  y"ix)  —  X  n(n  ~ 

n=0  n=0 

and  plugging  into  Equation  (35)  yields 

00  00  oo 

£  (n  +  2)(n  +  1  )an+2x"  -  £  2mnxn  +  £  2anxn  =  0. 

n=0  n= 0  n= 0 

This,  in  turn,  implies  that 


(2n  -  X) 

(n  +  2)(n  +  1) 


For  the  moment,  let  us  set  ax  =  0  and  a0  =  1.  Then  Equation  (37)  forces  all 
the  odd  coefficients  a3,  a5, . . . ,  to  be  zero.  Next,  observe  that 


2  (n  -  Xjl) 
a"+2  “  (^+2)  (n  +  1)  a"‘ 


For  large  n, 


(n  -  A/2) 
(n  +  1) 


1. 


Thus,  for  n  >  N  sufficiently  large, 


a 


n+2 


> 


n  +  2 


n>  N. 


Next,  look  at 

00 

z(x)  =  £  bnx " 

n=0 

where 


Then, 


? 

n  +  2 


(.9)bn  and  b0  =  1. 
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2-4-6 


(•9)3  = 


and 


,  ,  ,  (-9)  2  (-9)2  ,  (.9)3  , 

z(x)  =  1  +  ~X2  +  -^7- X 4  +  ^~x6  + 


1! 


2! 


3! 


(  (.9x2)  (.9x2)2  (,9x2)3 

=  1  +~Z+  ..  + 


1! 


2! 


3! 


It  is  clear  from  Equations  (39)  and  (40)  that  for  A  #  2 n  (so  that  a„+2  is  never  0) 

y(x)  >  Pn(x)  +  e  9*2 

for  some  appropriate  polynomial  pN(x)  of  degree  N.  But  then 

lim  e~x2/2y(x)  =  lim  [ [e~x2/2pN(x )  +  eAx 2]  =  oo. 

X-^+OO  X-^+OO 

Hence,  the  only  way  y(x)  can  satisfy  the  boundary  condition  (36)  is  if  A  =  2 n, 
so  that  its  series  terminates. 

The  case  a0  =  0,  ax  =  1  is  similar,  and  we  leave  it  as  an  exercise  for  the 
reader  (see  Exercise  13). 


Exercises 

1.  Show  that  the  boundary  terms  in  Equation  (8)  are  trivially  zero  if  either  b1  or  b2 
is  zero. 

2.  Here  is  the  outline  of  a  proof  that  L  [see  Equation  (3)]  is  not  selfadjoint  if 
b(x)  a'(x).  Observe  first  that  if  b(x)  ^  a'(x)  and  we  can  choose  u  and  v  so  that 

v(x)u'{x)  —  u(x)v'(x)  =  b(x)  —  a'(x), 

then  the  integral  in  Equation  (10)  reduces  to 

f  [b(x)  —  a'(xj]2  dx  #  0. 

Ja 

The  only  problem  is  that  u  and  v  have  to  satisfy  the  boundary  conditions  (3').  To 
modify  the  proof, 

(a)  Observe  that 


v(x)u'(x)  —  u(x)v'(x)  =  i;2(x)^-J  . 

(b)  Choose  w,  v ,  and  constants  cl9c3,..., ck+1  (with  k  to  be  determined)  so  that 


u 


-  J  =c1[h(x)-fl'(x)]+c3[h(x)-a/(x)]3  +  C5[h(x)-u'(x)]5  + 

v 
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u  and  v  satisfy  the  boundary  conditions  (3'),  and  the  integral  in  Equation  (10) 
is  7*0. 

3.  Show  that  Equation  (13)  is  true  even  for  u,  v  complex. 

4.  Show  that  Liouville’s  transformation 

y  = 

reduces  Equation  (11)  to 

z"  —  f(t)z  =  —  kz. 

What  is  /(£)?  Note  that  under  this  transformation,  the  eigenvalues  remain  fixed 
and  the  weight  function  becomes  unity. 

5.  Let  Pn(x)  be  the  Legendre  polynomial  of  degree  n. 

(a)  Show  that  P^x)  satisfies  a  selfadjoint  equation  with  k  =  n(n  +  1)  —  2. 

c  * 

(b)  Show  that  P^(x)P^(x)(  1  —  x2)dx  =  0,  m  #  n. 

J -l 

6.  Find  the  eigenvalues  and  eigenfunctions  of  the  boundary  value  problem 

x2y"  =  -ky,  y(  1)  =  0,  y( 2)  =  0. 

(Hint:  Try  y  =  xp.) 

In  each  of  Problems  7-9,  find  the  eigenvalues  and  eigenfunctions  of  the  given 
boundary  value  problem. 

7.  /  +  Ay  =  0;  y(0)  =  0,  y(n)  -  y'(n)  =  0 

8.  y"  +  ky  =  0;  y( 0)  -  y'( 0)  =  0,  y(  1)  =  0 

9.  f  +  Ay  =  0;  y( 0)  -  y'(0)  =  0,  y(n)  -  y'(n)  =  0 

10.  For  which  values  of  k  does  the  boundary  value  problem 

/  -  2y'  +  (1  +  k)y  =  0,  y( 0)  =  0,  y(  1)  =  0, 
have  a  nontrivial  solution? 

11.  Show  that  the  singular  boundary  value  problem 

/  +  ky  =  0,  y(0)  =  y(2n\  y'(0)  =  y'(2n), 

has  a  continuum  of  eigenvalues. 

12.  Suppose  that  y'(0)  is  bounded  as  x  -*  ±1.  Show  that  y(x)  must  also  be  bounded  as 
x->  ±1.  [Hint:  Write  y(x)  as  an  integral  involving  y '(*).] 

13.  Let  a0  =  0  and  al  =  1.  Show  that  the  power  series  solution  a„xn  of  Equation 
(35),  with  an  determined  from  Equation  (37),  will  not  satisfy  the  boundary  condi¬ 
tions  (36)  unless  the  series  terminates,  i.e.,  k  =  2n  for  some  integer  n. 


G P(xMx ))1/4’ 


t'(x)  = 


m 

p(x) 
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Some  simple  facts  concerning  functions 
of  several  variables 

1.  A  function  f(x,y )  is  continuous  at  the  point  (^.Vo)  if  f°r  every  e>0 
there  exists  8  (e)  such  that 

\f(x,y)  -/( Wo)l  <  e  if  \x  -  X0\  +  \y  -y0\  <  8  (e). 

2.  The  partial  derivative  of  f(x9y )  with  respect  to  x  is  the  ordinary  de¬ 
rivative  of  /  with  respect  to  x,  assuming  that  y  is  constant.  In  other  words 

3/(Wo)  /(*o  +  h,y0)  ~f(x0,y0) 


3.  (a)  A  function  f(xv...,xn)  is  said  to  be  differentiable  if 


0/  3 f 

f (■*!  •  •  •  yXn  +  ~~f(x j, . . .  ,xn)  —  Axj  +  . . .  +  -0^- Axn  4-  e 


rn 


where  e/[|Ax1|  +  ...  +  |AxJ]  approaches  zero  as  IAxJh-  ...  -h  IAat^I  ap¬ 
proaches  zero,  (b)  A  function  f(xl9...,xn)  is  differentiable  in  a  region  R  if 
3//  3*i,  •  •  •  >  3//  8xn  are  continuous  in  R. 

4.  Let  /=/(*!,..., *„)  and  Xj  =  gj(yl9...9ym)9  y  =  l, ...,«.  If  /  and  g  are 
differentiable,  then 


df_ 

fa 


dl^L 

a*,  a yk ' 


This  is  the  chain  rule  of  partial  differentiation. 
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5.  If  all  the  partial  derivatives  of  order  two  of  /  are  continuous  in  a  re¬ 
gion  R ,  then 


a  2f  a2/ 

-  =  -  •  i  Is  =  1  Yl 

dx/dxk  a  xkdxj’  J 


6.  The  general  term  in  the  Taylor  series  expansion  of  /  about  x,  = 
x\,...,xn  =  x%  is 
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Sequences  and  series 

1.  A  sequence  of  numbers  an,  n  =  1,2,...  is  said  to  converge  to  the  limit 
a  if  the  numbers  an  get  closer  and  closer  to  a  as  n  approaches  infinity. 
More  precisely,  the  sequence  ( an )  converges  to  a  if  for  every  6  >0  there  ex¬ 
ists  N(e)  such  that 

|  a  —  an\<e  if  n>N(e). 

2.  Theorem  1.  If  an-> a  and  bn-^>b,  then  an  ±  bn-^>a  ±  b. 

Proof.  Let  e>0  be  given.  Choose  N{(e),N2(e)  such  that 

\a-an\< ~  for  n>Nl(e),  and  \b-bn\<^  for  n>N2(e). 

Let  Ar(e)  =  max{A1(e),Ar2(e)).  Then,  for  n  >  N(e ), 

\an±bn~{a±b)\<\an- a\  +  \bn~b\<^  +  \  □ 

3.  Theorem  2.  Suppose  that  an+l>  an,  and  there  exists  a  number  K  such  that 
\an  \  <  K,for  all  n.  Then ,  the  sequence  (an)  has  a  limit. 

Proof.  Exactly  the  same  as  the  proof  of  Lemma  1,  Section  4.8.  □ 

4.  The  infinite  series  ax  +  a2  +  . . .  =  2  an  is  sa^d  to  converge  if  the 
sequence  of  partial  sums 

sx  =  ax,  s2  =  a{  +  a2,  ...,  sn  =  ax  +  a2+ ...  +an9... 

has  a  limit. 
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5.  The  sum  and  difference  of  two  convergent  series  are  also  convergent. 
This  follows  immediately  from  Theorem  1. 

6.  Theorem  3.  Let  an  >  0.  The  series  2  an  converges  if  there  exists  a  number 
K  such  that  ax  +  . . .  +  an  <  K  for  all  n. 

Proof.  The  sequence  of  partial  sums  sn  =  ax+  ...  +  a„  satisfies  sn+x>  sn. 
Since  sn  <  K ,  we  conclude  from  Theorem  2  that  2  an  converges.  □ 

7.  Theorem  4.  The  series  2  an  converges  if  there  exists  a  number  K  such 
that  \ax\  +  . . .  +  \an\  <  K  for  all  n. 

Proof.  From  Theorem  3,  2  \an\  converges.  Let  bn  =  an  +  \an\.  Clearly, 

0  <  bn  <  2|<zJ.  Thus,  2  bn  also  converges.  This  immediately  implies  that  the 
series 

[K~\an\] 

also  converges.  □ 

8.  Theorem  5  (Cauchy  ratio  test).  Suppose  that  the  sequence  \an+l/an\  has 
a  limit  X.  Then ,  the  series  2  an  converges  if  X<  1  and  diverges  if  X>  1. 

Proof.  Suppose  that  A<  1.  Then,  there  exists  p<  1,  and  an  index  N ,  such 
that  \an+l  \  <  p\an\  for  n  >  N.  This  implies  that  \aN+p\  <  pp\aN\.  Hence, 

n+k  I  a  I 

2  |^|<(l+P+---+P^)kjv|<T~", 

n  =  N  1  P 

and  2  an  converges. 

If  X>  1,  then  \aN+p\  >pp\aN\,  with  p>  1.  Thus  \an\->oo  as  n-> oo.  But  \an\ 
must  approach  zero  if  2  an  converges  to  s,  since 

1^/1+  ll  1*^1+  1  ^ I  1*^/1  *^1 

and  both  these  quantities  approach  zero  as  n  approaches  infinity.  Thus, 
2  an  diverges.  □ 
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C  Programs 

C  Program  1 

#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  X[200]  =  {1}; 
int  k,  N  =  15; 

printf(“  N%10sX[N]\n”,  “  ”); 
for  (k  =  0;  k<  =  N;  k  +  +) 

{ 

printf(“%4d%4s%17.91f \n”,  k,  “  ”,X[k]); 
X[k  + 1]  =  0.25+sin(X[k]); 

} 

} 
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C  Program  2 

#  include  <stdio.h> 

int  main(  ) 

{ 

double  X[200]  =  {1.4}; 
int  k,  N  =  5; 

printf  (“  N%10sX[N]\n”,  “  M); 

for  (k  =  0;  k<  =  N;  k  +  +) 

{ 

printf(“%4d%4s%17.91f\n”,  k,  “  ”,X[k]); 
X[k+ 1]  =  X[k]/2+  1/X[k]; 

} 

} 


C  Program  3 

#  include  <stdio.h> 

#  include  (rnath.h) 

int  main(  ) 

{ 

double  a,  d,  V[200]  =  {4 5.7}; 

double  c  =  0.08,  g  =  32.2,  W  =  527.436,  B  =  470.327; 

int  k,  N  =  15; 

a  =  (W  — B)/c; 
d  =  300*c*g/W; 

printf  (“  N%10sX[N]\n”,  “  ”); 

for  (k  =  0;  k<  =  N;  k  +  +) 

{ 

printf(“%4d%4s%17.91f\n”,  k,  “  ”, V[k]); 
V[k  + 1]  =  V[k]  +  ((a- V[k])/V[k]) 

*  (V[k]  +  d  +  a  *  log(1  -  (V[k]/a))); 

} 
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C  Program  4 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0.5},  a  =  1,  h; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  =  0;  k<N;  k  +  +  ) 

{ 

T[k  +  1]  =  T[k]  +  h; 

Y[k  + 1  ]  =  Y[k]  +  h  *  (1+  pow(Y[k]  -T[k],  2.0)); 

} 

printf(“%4s%16s\n”,  “T”,  “Y”); 
for  (k=0;  k<  =  N;  k  +  +  ) 

printf  (“%10.71f%2s%16.91f\n”,T[k],  “  ”,  Y[k]); 

} 

C  Program  5 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0.5},  a  =  1,  h; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  =  0;  k <  N;  k  +  +  ) 

{ 

T[k  +  1]=T[k]  +  h; 

Y[k+ 1]  =  Y[k]  +  h  *  (1  +  pow(Y[k]  — T[k],  2.0)) 

+  h  *  h  *  pow(Y[k] — T[k],  3.0); 

} 

printf(“%4s%16s\n”,  “T”,  “Y”); 
for  (k=0;  k<  =  N;  k  +  +  ) 

printf  (“%10.71f%2s%16.91f\n”,  T[k],  “  ”,Y[k]); 


551 


Appendix 


C  Program  6 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},Y[1000]  =  {0.5},  a  =  1,  h,  R; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  =  0;  k<N;  k  +  +  ) 

{ 

R  =  1  +  pow(Y[k] — T[k],  2.0); 

T[k  +  1]=T[k]  +  h; 

Y[k  +  1]  =  Y[k]  +  (h/2)  *  (R  +  1 

+  pow(Y[k]  +  h*R-T[k  +  1],2.0)); 

} 

printf(“%4s%16s\n”,  “T”,  “Y”); 
for  (k  =  0;  k  <  =  N;  k  +  + ) 

printf(“%10.71f%2s%16.91f\n”,T[k],  “  ”,Y[k]); 

} 

C  Program  7 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},Y[1000]  =  {0.5},  a  =  1,  h; 
double  LK1,  LK2,  LK3,  LK4; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  =  0;  k<N;  k  +  +  ) 

{ 

T[k  +  1]=T[k]  +  h; 

LK1  =  1  +  pow(Y[k] — T[k],  2.0); 

LK2  =  1+  pow((Y[k]  +  (h/2)  *  LK1)  —  (T[k]  +  h/2),  2.0); 
LK3  =  1  +  pow((Y[k]  +  (h/2)  *  LK2)  -  (T[k]  +  h/2),  2.0); 
LK4  =  1+  pow((Y[k]  +  h  *  LK3)  -  (T[k]  +  h),  2.0); 

Y[k+ 1]  =  Y[k]  +  (h/6)  *(LK1  +LK4  +  2*  (LK2  +  LK3)); 

} 

printf(“%4s%16s\n”,  “T”,  “Y”); 
for  (k  =  0;  k<  =  N;  k  +  +  ) 

printf  (“%10.71f%2s%16.91f \n”,  T[k],  “  ”,  Y[k]); 
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C  Program  8 

#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0},  a  =  1,  h; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  —  0;  k  <  N;  k  +  +  ) 

{ 

T[k+1]=T[k]  +  h; 

Y[k  + 1]  =  Y[k]  +  h  *  (Y[k]  *  (1  +  exp(— Y[k]))  +  exp(T[k])); 

} 

printf(“%4s%10s%20s\n”,  “N”,  “h”,  “Y[N]”); 
printf  (“%4d%2s%10.71f%20.101f\n”,  N,  “  ”,h,Y[N]); 


C  Program  9 

#  include  <stdio.h> 

#  include  (math.h) 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0},  a  =  1,  h; 
double  DY1,  DY2; 
int  k,  N  =  10; 

h  =  a/N; 

for  (k  =  0;  k<N;  k  +  +  ) 

{ 

T[k+1]=T[k]  +  h; 

DY1  =1  +  (1  -  Y[k])  *  exp(  — Y[k]); 

DY2=Y[k]  *(1  +exp(  — Y[k]))  +  exp(T[k]); 

Y[k+ 1]  =  Y[k]  +  h  *  DY2  +  (h  *  h/2)  *  (exp(T[k])  +  DY1  *  DY2); 

} 

printf  (“%4s%10s%20s\n”,  “N”,  “h”,  “Y[N]”); 
printf(“%4d%2s%10.71f%20.101f\n”,  N,  “  ”,h,Y[N]); 
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C  Program  10 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0},  a=1,  h; 
double  R1,  R2; 
int  k,  N  =  10; 


} 


h  =  a/N; 


for  (k  =  0;  k<N;  k  +  +  ) 

{ 

T[k  +  1]=T[k]  +  h; 

R1  =Y[k]*(1+exp(-Y[k]))  +  exp(T[k]); 

R2  =  (Y[k]  +  h  *  R1)  *  (1  +  exp(  — (Y[k]  +  h  *  R1)))  +  exp(T[k+ 1]); 
Y[k  + 1] = Y[k]  +  (h/2)  *  (R1  +  R2); 

} 

printf  (“%4s%10s%20s\n”,  “N”,  “h”,  “Y[N] ”); 

printf (“%4d%2s%10.71f%2s%18.101f\n”,  N,  “  ”,  h,  “  ”, Y[N]); 


C  Program  11 


#  include  <stdio.h> 

#  include  <math.h> 

int  main(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {0},  a  =  1 ,  h; 
double  LK1,  LK2,  LK3,  LK4; 
int  k,  N  =  10; 


h  =  a/N; 

for  (k  =  0;  k <  N;  k  +  +  ) 

{ 

T[k  +  1]=T[k]  +  h; 

LK1  =  Y[k]  *  (1  +exp(- Y[k]))  +  exp(T[k]); 

LK2  =  (Y[k]  +  (h/2)  *  LK1)  *  (1  +  exp(  -  (Y[k]  +  (h/2)  *  LK1))) 
+  exp(T[k]  +  (h/2)); 

LK3  =  (Y[k]  +  (h/2)  *  LK2)  *  (1  +  exp(  -  (Y[k]  +  (h/2)  *  LK2))) 
+  exp(T[k]  +  (h/2)); 

LK4  =  (Y  [k]  +  h  *  LK3)  *  (1  +  exp  ( -  (Y  [k]  +  h  *  LK3))) 

+  exp(T[k  +  1]); 
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Y[k+1]  =  Y[k]  +  (h/6)  *  (LK1  +  2*  LK2  +  2*  LK3  +  LK4); 

} 

printf  (“%4s%10s%20s\nM,  “N”,  “hM,  “Y[N]”); 

printf  (“%4d%2s%10.71f%2s%18.101f\n”,  N,  “  ”,  h,  “  ”,  Y[N]); 


C  Program  12 


#  include  <stdio.h> 

#  include  <math.h> 

intmain(  ) 

{ 

double  T[1000]  =  {0},  Y[1000]  =  {1},  a  =  1,  h; 
double  LK1,  LK2,  LK3,  LK4; 
int  k,  N  =  10; 


h  =  a/N; 

for  (k  =  0;  k<N;  k  +  +  ) 

{ 

T[k  +  1]  =  T[k]  +  h; 

LK1  =T[k]  *T[k]  +  Y[k]  *  Y[k]; 

LK2  =  pow(T[k]  +  h/2, 2.0)  +  pow(Y[k]  +  (h/2)  *  LK1 , 2.0); 
LK3  =  pow(T[k]  +  h/2, 2.0)  +  pow(Y[k]  +  (h/2)  *  LK2, 2.0); 
LK4  =  pow(T[k]  +  h,  2.0)  +  pow(Y[k]  +  h  *  LK3, 2.0); 

Y[k  + 1  ]  =  Y  [k]  +  (h/6)  *  (LK1  +  2  *  LK2  +  2  *  LK3  4-  LK4); 

} 

printf(“%4s%15s\n”,  “T”,  “Y”); 
for  (k=0;  k<  =  N;  k  +  +  ) 

printf (“ %  1 0.71f  %2s%  16.91f \n”,  T[k],  “  ”,Y[k]); 


C  Program  13 


#  include  <stdio.h) 

# include  <math.h> 

# define  PI  3.141592654 

int  main(  ) 

{ 

double  T[1000],  Y[1000],  h; 
int  k,  N  =  25; 

h  =  2.0/(double)N; 

T[1]  =  h; 
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Y[1]  =  0; 

for  (k  =  1 ;  k <  N;  k  +  +  ) 

{ 

T[k+1]=T[k]  +  h; 

if  (Y[k]  =  =  0)  Y[k+ 1]  =  h  *  T[k]  *  sin (Pl/T[k]);  else 
Y[k  + 1]  =  Y[k]  +  h  *  (Y[k]  *  pow(fabs(Y[k]),  -0.75) 

+  T[k]  *sin(PI/T[k])); 

} 

printf(“%4s%15s\n”,  “T”,  “Y”); 
for  (k  =  1 ;  k<  =  N;  k  +  +  ) 

printf(“%10.71f%2s%16.91f\n”,T[k],  “  ”, Y[k]); 


C  Program  14 


#  include  <stdio.h) 

#  include  <math.h> 

int  main(  ) 

{ 

double  A[200]  =  {1, 1},  T  =  0.5,  sum; 
int  k,  N  =  20; 

A[2]=  —  0.5  *  (A[1  ]  +  A[0]); 
sum  =  A[0]  +  A[1]*T  +  A[2]*T*T; 
for  (k  =  1 ;  k<  =  N  —  2;  k  +  +  ) 

{ 

A[k  +  2]  =  ((k  +  1)*(k— 1)  *  A[k  +  1]  — A[k]  +  A[k  — 1]) 
/((k+1)*(k  +  2)); 

sum  =  sum  +  A[k+2]  *pow(T,  (double)(k  +  2)); 

} 

printf(“For  N  =  %d  and  T  =  %6.41f\n”,  N,  T); 
printf (“the  sum  is:  %11.91f\n”,sum); 
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odd-numbered 

exercises 


Chapter  1 

Section  1.2 


1.  y(t)  =  ce 


—  „-sin/ 


3.  y(t)  = 


t  +  c 


11.  y(t)  = 


1  +  l2 

c  +  f(i+t2y/2(\+t4y/4dt 

(i+i2),/2(i+/4)l/4 

3e'2—  1 


5.  >-(r)  =  exp(  -  j  /3)[  fexp(\  t3)dt  +  c] 


9.  y(/)=exp(  -  jf'VTT 


s2  e  Sds 


) 


13.  y(t)  =  e~* 


2e+  V- 
J  i  1 


+  S‘ 


ds 


15.  7(0 = ( y  +  \  + t  +  c)o  +  t2r 1/2  17.  7(0= 

21.  Each  solution  approaches  a  distinct  limit  as  /->0. 
23.  All  solutions  approach  zero  as  t-^ir /2. 


2(1-0,  0</<l 

2(e-\)e~‘,  t>  1 


Section  1.3 

3.  127,328  5.  (a)  N23S(t)=  N2is(0)2-l°-,,/4-5-(b)  N^-NjdSSp.-*-*''-™ 

7.  About  13,550  b.c. 

Section  1.4 


1-7(0= 

7.^(0  = 


t  +  c 
1  —  ct 

9  +  21n 


3.  7(0  =  tan(/-^/2H-c) 


2  \  11  2 


—  oo<f<oo 


5.  .y(0=arcsin(csin/) 
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9.  >-(/)=  l-[4+2/+2f2  +  /3]l/2,  —2<t<oo 

n-yit)=rnk’  ^<t<0O’ifa=b 

abl  1 -«?*<*-*>']  .  a 

y(t)= - 77i — ; — .  tv, - rln-r<<<oo;  a^b 

a-bekib~a)l  k(b  —  a)  b 


ct‘ 


t2-\ 


13.  (b)y(t)=  - 1  and  y(t)=  15.  y(i)=  — j 


17.  y  =  c2e~2^,  t>0;y=-c2e2^ ,  t< 0  19.  t+ye,/y  =  c 

21.  (b)  \(b  +  c)(at  +  by)  +  an  +  bm\  =  ke(b+c^a,~cy^ 

23.  (t+2y)2  —  (t+2y)  =  c  — It 


Section  1.5 
3.  a  >0.4685 


7. 


(a)  =  0.003/>  —  0.00  lp2  —  0.002;  (b)  p(t)  = 

p(t)—*2  as  t— *<x> 


1,999,998  -  999,998e  -°M,r 
999, 999  -  999, 998^-°°°''  ’ 


Section  1.6 

NecNl 


1.  p{t)= 


N-\  +  e 


cNl 


Section  1.7 
!•  V(t)= 


7.  F(I)=V322 


[1  —  e(~cg/fV)t  j 


AT0e(64'4,('-5)/V322  +  j 

K0e(64A)(t~5)/V ***  - 1 


V322 


K 


(■-£) 


+ 1 


o 


9.  (.)  VP  -  We  +  « I. 

c  mg-cVVo 

11.  (a)  c  J  =  -Z~--2;  (b)  K0=  yp2gR 
ay  ( \i  a - 1?  v 


V322  (l - l- 

\  Ve 

-ct  ^  T,  «¥ 
1^-;(b)Fr=-r 


)- 


Section  1.8 

3.  yrs.  5.  lOV0  7.  (a)  c(/)=l  —  t-0001';  (b)  1000 In |  min. 

9.  c(0  =  i(l-e-002')+^e-°02'  11.48%  13.  jcy  =  c 

15.  x 2  +y 2  =  cy  17.  ,y2(lnp2 —  1) = 2 (c  —  jc2) 
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Section  1.9 

3.  t2  sin y  +  y3e‘  =  c  5.  y  tan  /  +  sec  t  +  y2  =  c  7.  y(t)  =  /~2/3 
9.  y(t)  =  -  t1  +  V/4  -  (/3  -  1)  11.  Py  +  =  10 


13.  a=  —  2; y(t)  =  ± 


1(2/-  1) 
2(l+cf) 


1/2 


19.  a=  1;  >>(i)  =  arcsin[(c-/)e'] 


Section  1.10 

#4  j.6  *2  n 

!•  7*(0=s'2  +  2?  +  3T  +  *  *  *  +  T^f 

3.  7i(0  =  e'-l;  y2(t)  =  t-e‘+  1  - 

..  107  .  t  ,  r2  ,  t3  ,  ,  0  +  0*2' 

^(0  48  +  4  +  2  +  3  +  2 

A 

19.  >^(0  =  siny 


Section  1.11 

3.  (a)  0<  x0<  10  5.  1.48982  7.  (c)  0.73909 


Section  1.11.1 

5.  0.61547  7.  0.22105  9.  1.2237 


Section  1.12 


1.  7* =  ( ■ -  V  ■ -  i K ■ -  7)"  ■ -  1 1  3.  (a)  En  <  i(3w  -  1 );  (b)  £„  <  2(2"  -  1) 


3*  7n  ^1  •  •  •  &n  —  1 


n~\  b; 

*  +  2 


.  Cl\  ...  Cl; 
7  =  1  *  7 

9.  (a)  jc*  $25 1.75;  (b)  x  =  $289.50 


7-  725“ 


25 


24 

1+  2  * 

7=1 


Section  1.13 

1.  /?=0.1,710=1;  =  0.025, =  1 

3.  /z  =  0.1, 710=  1;  h  =  0.025, 740=  1 
5.  /1  =  0.1, 7 10=1. 805 16901;  A  =  0.025,  yA0  =  1 .94633036 


Section  1.13.1 

3/z  „/<  /*(l  +  e  +  e2) 

1.  £*<  4^(e4/5-l)  3.  Ek  <  — --- ~2e - -[ee^e+l)-\] 


5. 


4(10-5) 

3(e2-l) 


Section  1.14 

1.  h=0A,yw=  1;  /i  =  0.025, ^40=  1  3.  A  =  0. 1, >'10=  1;  /i=0.025,  >>40=  1 

5.  /i=0. l,>'i0=2;  h  =  0.025,  y40=2 
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Section  1.15 

1.  A=0.1,/i0=l;  /i= 0.025, >>40=  1  3.  A=0.1,.yio=l;  A =0.025, >>40=  1 

5.  A =0.1,  _y10=  1.98324929;  A  =0.025,  >>40=  1.99884368 

Section  1.16 

1.  A=0.1,>'io=  1;  A=0.025, >>40=  1  3.  A=0.1,^|O=l;  A=0.025, ^40=  1 

5.  A  =0.1,  >>,„=  1.99997769;  A = 0.025,  y#  =  1 .9999999 


Section  1.17 

1.  2.4103  3.  0.0506  5.  0.4388 


Chapter  2 

Section  2. 1 

1.  (a)  (4— 3/)e';  (b)  3V3  fsinV3  t;  (c)  2(4-3/)e'+  12V3  /sinV3  t ; 

(d)  2-3 12;  (e)  5(2 -3r2);  (f)  0;  (g)  2-3r2 

5.  (b)W=^;(d)y(t)  =  2Vt 

7.  (a)  — (AsnwrtsinAH-acosa/cosA/);  (b)  0;  (c)  (A  —  a)e(-a+b)‘;  (d)  e2a';  (e)  t; 
(f)  —be7*' 

13.  w=j 

Section  2.2 

I.  y(t)  =  cte‘  +  c2e~‘  3.  ^(/)“C3'/2[ciev^,/2  +  c2e-v^'/2J 

5.  y(0- I(e4'+4e-’)  7.  ,(,)=  ^-e"'/2[e3V3 

9.  V  >  -3  11.  y(t)  =  clt  +  c2/t5 


Section  2.2.1 
1.  y{t)  —  e~ t/2 


V3  t  .  .  V3 t 

c,cos— ^ — hc2sin— j— 


3.  y{t)  =  e  cos  V2  /  +  c2  sin  V2 
5.  y(t)  =  e~t/2\ cos^  * 


9.  y( t)  =  e('-2)/3 


cos 


V7 

VTT  (t-2) 


3  .  V7  / 

sin— r— 


4  .  VTT(/-2) 

sm - z - 


VTT 


11.  yM  =  cos  cot,  y2(t)  =  sin  cot 
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Answers  to  odd-numbered  exercises 


15.  v7  = 


(1  +  0 


;  VT+7  =  —  [ VV2  +1  +/VV 2  -1  1 


V2 


V2 

V^l=-~ ;  Vvr  =  ±  [  V  V2  —  1  +/VV2+1  ],  (Vi  =ei,T/4) 


19.  >>(0  = 


1 

Vt 


V2 

V7  ,  .  V?  ,  . 

c,  cos  -y-  In  t  +  c2  sm  — In  t 


Section  2.2.2 

\.  y(t)  =  (cx  + c2t)e3'  3. /(0  =  (l  +  j0e_'/3  7.  y(t)  =  2(t-ir)e^'-^3 

11.  y(t)  =  (c,  +  c2t)e '2  13.  y(t)  =  cxt  +  c2(t2  —  1)  15.  y(t)  =  cx(t  +  1)  +  c2e1' 

,  c,  +  c2ln/ 

17.  y{t)  —  Cj(l  +3/)  +  c2e3t  19.  >>(/)  = - - - 


Section  2.3 

1.  y(t)  =  cx  +  c2e2t  +  t2  3.  >>(/)  =  e'2  +  2<?'-2<?~'3 


Section  2.4 


1.  y(t)  =  (c]  +  \ncost)cost  +  (c2  +  t)sint 

3.  >>(/)=c,e'/2  +  [c2  +  9/-2/2  +  j/3]e' 

5.  y (t)  =  -^(2  cos t  —  3  sin  /)e  “ '  —  e  ~ '  +  e  “ ///3 

7 .  y(t)  =  J  VJ-±\  [e2{t~~s)  —  e^~s)]ds  9 •  >>(/)  =  c/2+ y  + 


11.  y(t)=V~t 


c1  +  c2ln/  +  J*  /Vs  coss[ln/ —  lns]<& 


Section  2.5 

1.  W)-  3.  +  S.m=je~' 

7.  \p(t)=  r [sin 2/  —  2/cos2/]  9.  ip(t)  =  j  +  ^(cos2/  —  4sin2/) 

11.  i//(/)=  yy  (cos/  +  7sin/)-i-^  -  13.  \p(t)  =  t(e2t  -  el) 

15.  ip(t)=  —  cos  3/  4-  ^/sin/ 

17.  (b)  WO  =  0.005+  ^2o^-Qjg-(15sin30/  — 20,000cos300+  y  sin  10/ 
Section  2.6 


1.  Amplitude  =  J,  period  =  frequency  =  8 
7.  a  >  /?,  where  [l+(l-/?)2]e-2/*=  10~6 
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Answers  to  odd-numbered  exercises 


9.  >>(/)  = 


e-(t-v) 


11.  m  /  2  seconds 


/  ]  (-77  + 1)  \  - 

(  2  H - 2 — e  "  jcos(^~  tt)+  2e  wsin(/  —  7r) 


Section  2.6.2 

3.  e(l)=J4  1  —  ( cos 500 V3  +  — 
10*[  v  V3 

3 

Steady  state  charge  =  25q  qqq 

_  _/  1  _  R2  \'/2 

1'LC  2  L2] 


j=r  sin 500 V3  )e~500 


Section  2.8 


1-7(0  =  a0e  ,2/2  +  a]  2 


OO 


(-1)< 


n<2n  +  1 


„=o  3-5...(2n+  1) 


3.  >’(/)  =  a0 


3 12  3t 4 

1  +  ^V  + 


3/6  3-3/8 


22  24-  2!  26-3!  28-4! 


3-3-5/ 

2l0-5! 


10 


+  .  . 


OO 


5.^(0=  2  (#i+ix'-i> 

n  —  0 


7.  y{t)~  -2[t  +  -^-z  + 


2n 


t 


11 


+ 


t 


16 


5-6  5-6- 10-11  5-6- 10* 11  *  15* 16 


+  . . .  ] 


„  ,  ,  .  A/2  A(4-A)/4  A(4  — A)(8  — A)/6  _ 

9.  (a)  y\{t)—  1  -  - ^ - ^ - +  ... 


4! 


6! 


+  a 


(/+i 


(b)T2(0='+(2-A)^+(2-A)(6-A)^  +  (2-A)(6-A)(10-A)^  +  ... 

11.  (a)y,(/)=l-«2^-«2(22-a2)^ 

—  a2(22  —  a2)(42  —  a2)  —  +  . . . 

o! 

72(0  =  t + (l2  -  a2)  +  (l2  -  a2)(32  -  a2)  +  . . . 

13.  (a)^(/)=l-i/2-i/3+^/4+...  (b)>-(i)w0.8592436 
15.  (a)>-(/)=l-i/2-i/3  +  i/4+...  (b)y (|)»0.86087198 
17.  (a)T(0  =  3  +  5/-4/2  +  /3-|/4+...  (b)^(3)^5.3409005 


Section  2.8.1 
1.  y(t)  =  clt  +  c2/t5 
3.  y(t)  =  cl(t-l)+ c2(t-l)2 
5.  y(t)  =  /(c,  4-  c2ln /) 

7.  y(t)  =  c,cos(ln /)+  c2sin(ln /) 
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Answers  to  odd-numbered  exercises 


9.  y(t)  = 


2\[2 


Section  2.8.2 
1.  Yes  3.  Yes  5.  No 


7.XO  =  ^(l-*-g-3^-3.5.4! 


+  C^'/2(1  +  I+5T2! +F?W+  •' 

9.  y(t)  =  0^+21  +  ^ +  c2t'M\  +  ^  +  ~^ 


23*  3 !  *  5 • 7 


+ 


11.  y(t) 


,  .  3/  ,  3 2t2  , 

Cj  I  1  +  +  „  _  „ .  + 


3,3 


37 


1-3  3-7*2!  3-7-11-3! 

373 


+ 


+  c  tw*\  j  +  +  _JLi — |__ 

5  5-9-2!  5-9-13-3! 


+ 


13.  yx(t) 


=t5M 


t2  t 4 

H - 1 - - - h 

\  2-5  2-4-5-7 


4  2-4-6-8 


15.  >,1(0  =  e'2/2,>-2(0  =  r3  +  y  +  ^  +  ^:^  + 
17.  >>,(0  =  y  -  1,  J2(0  =  y  [*-'  -  (1  -  0] 


l9-  (e)72(0  =  '3e  '/ ^  dt 


Af2 

21.  (b)  >-,(;)  =  t\ 2-3/  +  -  ^rr  + 


2! 


3!  4! 


(c)  ^(O  =  .Vi(0  / -frr-  <* 

y  J'i  (0 


23.  (c)  y2(t)  =  J0(t)f 


dt 


tj2(t) 


25.  (b)X0  =  l 


Xt 


(1  -A)?2 


+ 


+ 


00  OOW 

(~  A)(l  —  A)-  •  •  ( n  —  1 

(h!)2 


Al,. 


+ 
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Answers  to  odd-numbered  exercises 


27.  (b)  yx(t )  —  1+2/4- 12  +  -rr  +  tt  + 


15  14 


yi(t)  =t 


=  t'/ 2 


89 1 


+ 


94\t‘ 


5/  \lt2 

1  +  "6"  +  _60~+  (60)(21)  '  (36) (21) (60) 


+ 


Section  2.8.3 

1.  Ti(0  =  1  +4/  +4 12  +  •  •  • 


176 


8i  +  l2r2  +  — 13  + 


27 


00 


3.  (b)  /,(0  =  i*  2 


(-1)"/ 


n,2n 


„  =  o  22”(«  +  !)!«! 


(c)  y2(t)  =  -Jl(t)lnt  +  ~ 


00 


1-  2 


(-o^+gr1 

)2n 


i2« 


„  =  i  22"« !(«  —  1)! 


Section  2.9 

1.  \  3. 

s2 


s  —  a 


( s  —  a )  +62 


5.  iU  + 


2  I  -s  s2+4tf2 


7.  i 


fl  +  Z>  ,  a  —  b 
*r 


^  [  s2  +  (<z  +  Z>)2  s2  +  (a  —  Z>)2  J 

15.  ^(/)=2e'-ie4,-ie2'  17.  y(s)  = 

2f2  +  i +  2 


.s2  +  6j  +  6 


19.  y(i)  = 


(j2  +  3s  +  7)(s2+1) 


(*+0 

23.  y(0=  -?«'  +  ie2' 


2C  ~  6C 


Section  2.10 


1. 


n\ 


,n  + 1 


3. 


las 


5. 


7 T 


7.  (a)  *  - 


(j2  + tf2) 

arc  tan  (b)  In 


J_5^ 

Ss3 


vr 


Vs2  +  a: 


;  (c)  In 


■s  —  Z> 

5  — J 


11. 


u  V57  t  ,  3 

cosh  — x - 1 - 


V57 


sinh 


V57  f 


9.  -4+2c“'  + 


■3,/2  13.  i(3  -  t)t2e 


>2^  / 


15.  ^(cosf-te  ')  19.  >>(/)  =  cos/  +  §sinr- ^rcosr  21.  >>(r)=£/V 


23.  >>(/)  =  1  4-  e  '  + 


_ „  V3  t  1  .  V3  t 

cos  — ^ - - sin  — x — 


V3 


-r/2 
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Answers  to  odd-numbered  exercises 


Section  2. 1 1 


1.  y(t)- 
3.  y( /)  = 

5.  y(t)  = 


(2  +  3t)e~'  +  2H3(t)[(t  —  5)  +  (t  —  l)e_<'_3)] 

3cos2/  — sin2/  +  j(l  — cos2 /)—  2(1  —  cos2(/  —  4))HA(t) 

3cosr  — sin/  +  jtsinM-  j//„/2(0[(; ~  2,r)s'n,  — cos,l 


7-  y{t)=h  (7/-l)+  cos 


V27  t 


V27  / 


V27 


-</2 


[  r--  ('-2) 

-H2(t){7t-\)-H2(t)  13cosV27  — ^ — 


V27  (r-2)  1 

+  V27  sin - 1 - -  e-«-»/2 


9.  T(0=^'  +  7/l(0[2  +  t  +  (2/-5)c'-|]-//2(0[l  +  t  +  (2/-7)e'-2] 


Section  2.12 

3.  (a) y(t)  =  (cosh  j t  —  3 sinh  j t)e3l/2  —  2//2(l)sinh  I(t  —  2)e3('~2,/2 

5.  T(0=Ksin^  —  tcost)  —  H„(t)smt 

7.  ^(0  =  3te“'  +  2<2c“'  +  3W3(0(t-3)e-('-3) 


Section  2.13 

,  eb'  —  eal  .  a  sin  at  —  b  sin  bt  -  sinai  —  atcosat  _  ,  •  . 

b-a  a2  —  b2  2a 

9.  jlsint  11.  (/  —  2)  +  (/  +  2)e~'  13.  y(t)  =  t  +  §sin2f  15.t(0=2/2 

17.  T(0={sin'+(l-i0e“' 


Section  2.14 

1.  x(t)—  c,e3'  +  3c2e4',  y(t)  =  cle3'  +  2c2e‘" 

3.  x(t)=  [(c,  +  c2)  sin  t  +  (c,  —  c2)  cos  t]e~2‘, y(t)  =  [ci  cos  t  +  c2  sin  t]e~2‘ 

5.  x(0=|e3'  +  2e"',  y(t)=\e3‘-\e~‘ 

7.  ;t(/)  =  cos/  e~\  >>(/)  =  (2  cos /  + sin /)e-' 

9.  ;c(/)  =  2(/cos/  +  3/sin/  +  sin/)e',  >>(/)= —2/sin/ e' 

11.  x(t )—  — 4sin/— y/sin/  — /cos/  +  5cos/ln(sec/  +  tan/), 

>>  (/)=—/  sin  t  —  t  cos  t  —  \  sin2 /  cos  /  —  5  sin  /  cos  /  +  5  sin2 / 

—  j  sin3 /  +  5(cos  t  —  sin  /)  ln(sec  /  +  tan  /) 

Section  2.15 

1.  y(t)=*clet  +  c2e~t  +  c3e2t  3.  y(t)  =  (ci  +  c2t  +  c3t2)e2‘ +  c4e~‘  5.  y(t)=0 
7.  >>(0“  -3-2/-  jt2  +  (3-t)e‘  9.  *//(/)=  1  -cos/-lncos/-sin/ln(sec/  +  tan/) 
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Answers  to  odd-numbered  exercises 


11.  i (/(t)=-^Y  f'[sin(t-s)/V2  cosh(/  —  s)/V2 

-cos(t-s)/V2  sinh(/-i)/V2  ^g(s)ds 

13.  ^(0  =  5<(e_2'-l)-}sint  15.  >K0=  4/2[(i  “ iOcos' +  (4  +  6<_Tl,2)sin/] 

17.  + 


Chapter  3 


Section  3.1 


Section  3.2 

1.  Yes  3.  No  5.  No  7.  Yes  9.  No  11.  Yes 


Section  3.3 

1.  Linearly  dependent  3.  Linearly  independent 
5.  (a)  Linearly  dependent;  (b)  Linearly  dependent 
7.  (b)y,(0=e',  y2( t)  =  e~'  9.pl(t)=t-l,  p2(t)=t2-6 


11.  x1  =  I  - 


17.  x,  =y, 

*2=  -j=-(y2~yi) 

V2 

-^(yi+yi) 

V2 


Section  3.4 


1.  x'(t)= 


x2(0= 


cosV3  t/2  e  ,/2 

(~>sV3t/2-iV3  sinV3  t/2)e~'/2 
sin  V3  t/2  e~'!2 

(§cosV3  t/2—  jsinV3  t/ 2)e~,/'2 
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Answers  to  odd-numbered  exercises 


3.  x'(0  =  |®j,  x2(/)  =  |2^,j  5.  Yes  7.  Yes  9.  No  11.  (b)  No 

Section  3.5 

3.  —48  5.  0  7.  —97  11.  No  solutions 


Section  3.7 


7.  No  solutions  9.  x=  |  0  |  11.  A=  1,  —  1 

\0/ 


Section  3.8 


l.  *(0=ci( j )e3'  +  c2( 2 )^4/  3.  x(/)  =  c,|  oje  '  +  c2|  2^e  '  +  c3|lj<?8' 


5.  x(0=c,l  oje-,0'+  C2(l)  +  C2(oJ  g5' 
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Answers  to  odd-numbered  exercises 


9.  x(0=|  -2  je2'  11.  x(/)=|  0  je-2' 

13.  (b)x(/)  =  9|oje/-1  +  2  1 


Section  3.9 


—  V2  sin  V2  t  —  V2  cos  V2  t 
cos  V2  t  —  V2  sin  V2  t 
—  3  cos  V2  / 


Section  3.10 


j  f  7e'-  5e2'  +  8e-/  -  e‘+  5e2‘-  4e~t  3e‘-  5e2t-2e~r 

13*  0>)  1q  21e/  —  5e2t—  \6e~t  —  3el  +  5e2t+  $e~*  9el—  5e2t  —  4e~t 

1 14e'  + 10e2'  — 24e_/  —  2et  —  \0e2t  +  \2e~‘  6e'  +  10e2/  —  6e~^ 

15.  1+  — 

a 

Section  3.11 

j  e~2' +  5e2' —  e3‘  5e2'  —  5e3‘  e~2‘  —  e3' 

1.  j  -e-2,  +  e3,  5g3,  _ e-2l  +  e3, 

1  4e~2‘  -  5e2‘  +  e3'  -5e2'  +  5e3‘  4e~2'  +  e 3'. 
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Answers  to  odd-numbered  exercises 


3  — 
A  5 


6e2/  — cos/  —  2  sin/  —  2e2'  +  2cos/+4sin/ 

3e2t  —  3cos/  —  sin/  —  e2/+6cos/  +  2sin/ 

5  sin  t  —  lOsin/ 


5. 


(t+\)e  1  (/  +  \)e  t  —  e 


—  te 
te~* 


3  1 

7.  A=|  1  3 

3  3 


-/  .  ^  —  2/ 


—  te  +e 

-t 


te 


2t  e  1  —  e  2/ 
€~2*  —  €~t 

„-t 


-1 

-1 

-1 


9.  A  = 


13 


16 

8 

0 


-25 

-6 

13 


le2t  — 2  cos /  + sin/ 

e2t  —  cos/  +  3sin/ 
5  cost 


30 

-24  I  11.  No 
26 


Section  3.12 

1.  x(/)  =  2e'('cos'  +  3,shl'  +  sin^ 
V  —/sin/  / 


3.  x(/)  = 


—  4sin/  —  ^/sin/  —  /cos/  +  5cos/ln(sec/  +  tan/) 

—  /  sin  /  —  /  cos  /  +  sin  /  cos2 t  —  {  sin2  /  cos  / 

—  5  sin  /  cos  /  +  5  sin2/  —  \  sin3  / 

+  5(cos  /  —  sin  /)  ln(sec  /  +  tan  /) 


5.  x(/)  = 


3e3t —2  e2t  —  te2t 
e2i 

3e3t  —  le2t 


ii.  m 


4 1  '  2*  '  8  1 

4*  *  T  8  I 


13.  iKO  =  I  -2  17.  *(0=1-1  Ife 


3/ 


Section  3.13 

l.x(, )-«-(’)+.*(  ~l) 

3.x(0--f(J)«-'+|(5p'+|(")-<(|)+3f])«- 

l  —  /  —  /2 


5.  x(/)  =  er| 


3,2 


r  x/^_2^i//cos/4-3/sin/  +  sin/\ 
W  V  -/sin/  / 


9.  x(/)  = 


/  <  7T 


/  cos  2/  +  sin  2/ \ 

V  2  sin  2/  )9 

(  cos  2/  \ 

Vsin2/  +  cos2//  ^ 


1-/ 

11.  x(/)=(  t  \e2t 
t 


/—  /2—  -/3  —  -/4+  — /5 
it  6  *  2l  T  12* 

i  i 

13.  x(/)  = 

15.  x(/)  = 

i+/ 

i 

i2+i<3 

*  ^ 

[  1  +  2/  J 

.3/ 
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Answers  to  odd-numbered  exercises 


Chapter  4 

Section  4.1 

I.  x=0, >>=0;  x  =  0,y  =  \;  x=  1,^  =  0;  x=j,y=i\ 

3.  *=0^=0,z  =  0;  x=5,/=|,z=-(^  +  ^j 

5.  x =0,  y  =y0;  y0  arb.  7.  x =0,  y  =  —  2,  z  —  m 
x  =  x0,  y  =  1 ;  x0  arb. 
x  =  x0, y=  —  \;  x0  arb. 

Section  4.2 

1.  Stable  3.  Unstable  5.  Asymptotically  stable 
7.  Asymptotically  stable  9.  Stable 

II.  x(t)  =  1  is  stable;  x(/)=0  is  unstable. 

Section  4.3 

I.  x = 0,  y  =  0  is  unstable;  x  =  1 ,  y  =  0  is  unstable; 

x  =  —  l,y  =  0  is  unstable;  x=0,  y  =  2^4  is  stable; 
x = 0,  y  =  —  21/4  is  stable. 

3.  x  =  0,  y  =  1  is  unstable;  x = 0,  y  =  -  1  is  unstable; 
x  =  1,  y  =  0  is  unstable;  x  =  —  1,  y  =  0  is  stable. 

5.  x=0,  y  —  m  is  unstable  for  all  integers  n. 

7.  Unstable  9.  Unstable  11.  Stable  13.  Stable 

15.  Unstable  17.  Unstable 

Section  4.4 

1.  y =cos(x  —  l)2  3.  y  =  tanx  5.  x2+y2  =  c2 

7.  The  points  x  =  x0,  y  =y0  with  x0+y0=  —  1,  and  the  circles  x 2+y2  =  c2,  minus 
these  points. 

9.  The  points  x  —  0,y  =y0;  x  =  x0,  y  =  0,  and  the  curves  y  =  ce  “(V3)*3*,  *  >  0; 
y  =  ce  _(2/3)e3*,  x  <  0. 

II.  The  points  x=0,  y=y0,  and  the  curves 

{ad—  be) 

( by  —  dx ) - ^ - ln|c  —  dy\  =  k,  x>0 

(ad—  be) 

{by  —  dx) - ^ - \n\c  —  dy\  =  k,  x<0 

13.  The  point  x  —  0,  y  =  0;  and  the  curves  xy 2  —  j  x3  =  c. 

Section  4.5.1 

3.  —  — In (b  +  cy)=  +  k  5.  ln(6  +  —  Ar  ln(a  +  ex)  + 

c  ia  a  c 
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Answers  to  odd-numbered  exercises 


Section  4.7 
13.  (b )y2  =  x2  +  x4+c2 

Section  4.8  ‘ 

5.  The  circle  x2+y2  = 
point. 

7.  The  circles  x2+y2 
Section  4.9 

1.  c  —  1,1 

3.  e=-2,2 

5.  No  bifurcation  points 

Section  4.10 

1.  x  =  0,y=0  is  unstable;  x  =  {a/ e)9y  =  0  is  stable  if  ad<ec,  and  unstable  if 
ad  >ec;  x  =  (af+  be) /( ef  +  bd\  y  =  {ad—  ec) /{bd  +  ef)  exists  only  for  ad  >  ec 
and  is  stable. 

3.  xx  =  0,  x2  =  0,  y  =  0;  xl  =  c/d,  x2  =  nc /[d {ax  +  a2)],  y  —  na2  —  a2  —  axa2; 
assuming  that  na2 >  a2  +  a xa2. 

(ii)>,|=  1’-*'2=o;  (c)«2=i 

Section  4.12 

3.  (a)  rI+XI2/2  =  ylnS  —  rS  +  c;  (b)  Yes 

5.  (a)  0.24705;  (b)  0.356;  (c)  0.45212;  (d)  0.60025;  (e)  0.74305;  (f)  0.77661 
Section  4. 1 3 

7.  Either  x(t)—>0  or  .*(/)-»— -g — -  (for  /?,<:  —  a,  >0) 


=  1.  Note  that  every  point  on  this  circle  is  an  equilibrium 
=  (2n+  1)tt/2 


Chapter  5 


Section  5.1 

(2n+  1  )2w2 
1.  X„  = - — - ,>'(x)  =  csin 


(2/Z  -I-  1)7TJC 
m  21 


'l  \  .  \  —n27r2  ,  v  mrx 

3.  A  =  0, y  —  c;  A= — - — ,  y(x)  =  ccos— r 

r  * 

5.  y (x)  =  c sinh  V— Aq  x ,  where  sinh  V  —  A0  w  =  V-A0  cosh  V-Aq 
y  (x)  =  c  sin  x9  where  tan  it  =  \f\n  . 

7.  A=  — 1  >y{x)  =  cex\  A  =  /i2,  y{x)  =  c[n cos nx  +  simnx] 
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Answers  to  odd-numbered  exercises 


Section  5.3 

1.  u(x ,  t)  =  sin  j 7 Txe~  1/71*V4 +  3  sin  f ttx e  -O-71^2*/4 
3.  w(x,  0  =  3  sin  27rx:  e ^  ~ 4,r2^  —  7  sin 4ttx:  e(1  ~ 16,r2)/ 

5.  w  (/,>>)  =  e2(/+-y)  +  ^_3(/+>,) 

7.  w(/,^)  =  e“5/^_4>'  +  2e""7^_6-v  —  \4enteUy 
9.  (&)X"-iiX=0;  Y”-(il+\)Y= 0;  T'-\a2T= 0; 

(b)  /)  =  sin  sin  e a2"2 7r2(7>2  ~  a2)‘/a2b2 


Section  5.4 

-  x  4  f  sinTTJt  ,  sin37r^c  ,  sin57rx 


i-  /w=  ^ 

7T 

3.  /(*)=!’ 

7T 


.A  M  ✓V  *  I**  XXX.  «•  «TV  *  V/AAA 

T  +  3  +  5 


+  . . . 


2  r  sin7rx  sin277\x:  sin37rx 

7T  1  2  3 


5  c  /  \  3  1  i  Tvn 

.  f(x)=  -7 - y  -  Sin  ^r-  cos 

*/v/4  tt  ",  n  2 

n—  1 


mr  mx 


1 1  tvrr  \  • 
^1  -cos-^-Jsin 


mx 

2 


e‘-  1  »  e/(—  I)"—  1  r  nm 

7./W- +  ,!+„v  ['«*- 


mx  .  fl77\X 

— ; - «77Sin — r- 


7TX  ' 
/  . 


9.  /(*) 


»  /(e'(-i)--i) 


mx 


e  ,  ~  v<  v  '  m. 

l  /2+«V  I 

e‘-\  .  •  (-iyv-e-')r. 


tt  \  e7—  1  S  (  0  \e  e  )  [ .  mx  .  mx  1 

11.  f(x) - —  +  £  l2  +  n 2„2  [  1  «*  —  +  ™ Sin  —  J 


13.  f(x)  =  |  sin x  —  £  sin  3x 


^2  oo  (— H^COS/ZX 

17.  (a)/(x)=  ^-+4  2  - j - 

•j  ,  n 

n  =  1  n 


Section  5.5 


1.  /(*)= 


e  — 

1  00  2(1— COSWr/^) 

e 

■+.?,  .-V 

COS  tTTTX 

3a 

oo  4a(coswr/2  —  1) 

mx 

4 

+  2  22 
n- 1  nir 

cos  — 
2a 

5 •  /(*)=  7 +  2  -4r2cosmr/2  —  1— (— l)"]cos^7^- 

4  w  ' 


7. /(*)=§  2  ^(coswr/2  — (— l)")sin 


7 r  ,  /Z 

n  =  1 


•  mx 

m~T 


9.  /(x)  =  sin2x 


n  /  \  •  2,4  cos2x  .  cos4jc  .  A  ^ 

11.  (a)  sin.x=  —  +  —  - -  H - -  +  ...  ;  0<*<1; 

7T  7T  [  1  —  22  1  — 42 

(b)  cosjc=  -[  2sln2x  +  4sin4*  +  ...  1;  0<  jc<  1;  (c)  No 
■tr  j2  —  1  42  —  1 
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Answers  to  odd-numbered  exercises 


Section  5.6 

'  A»0  ^ 

oo  2n(l -(-!)" cos  10) 


(b)  u(x,t)=  S 

/7  =  1 


_ oin  WTX  -0.86?!2 77^/100. 

nV- 100  10  * 


00 


(c)  «(*,/)=  ioo  2 

/I*8 1 


4sinwr/2  (—  1) 


n  n 


„2  _2 
/J  7T 


AMT 


cm  ^-0.86n2iA/100. 

sm  1(X)  e 


(d)  u(xj)  = 


130  £  ((“!)" -cos3w/10) 


m 


n  =  1 


n 


cm  ^-0.86/»27t2//100 

sm  1Q  e 


5.  (b)  u(x,  /)  =  lOx  +  800  f  -mr2/2  sin^V-"2^' 

/»=1  «27T2  1U 


Section  5.7 

4  ^ 

1.  u(x,t)=-  2 


2ai  +  1 


7rn=oL(2A2+l)2-4  2W+1  J 


sin(2Ai  + 1)  ^  cos(2/j  + 1)  y 


a  12  v  sin2«7r/3  .  n7rx  met 

3.  «(x,/)=—  2,  - ^ - sin-^— cos— j— 

mt 


_2 

77  wasl 
oo 


at 


AZ77  .  TVHX 


m  ,  8^1.  mr  . 

5.  k(x,/)«  —  2  -T  Sin -y  sin  - 

77  /i«i  n  L  1 


cos 


2  ““  2  ~~  10 

9.  i/(x,^,0=A'(x)y(/)7’(0;  K(y)  =  a2cos  V^-^y  +  ^sin 


m2  y 


^  (x)= aj  cos/ix  +  blsinnx  T(t)  =  a3  cos  Xct  4*  b3  sinXcf 


Section  5.8 

mix  — a)  nirv 

1.  u(x,y)=  2  c„sinh - s — -sin  —  ;  c„  = 


-2 


n  =  \ 
oo 


b  ’ 


3/  \  mx  .  * 

•  #(*0')“  2d  cnsm~~^~  sinh 


mx  .  .  rnr(y-b) 


/!=  1 


a 


c  = 


6  sinh  n?ra/  6 

-2 


/>>: 


.  .  AMjy  , 

isin-^— dy 


’  71  tfsinhAwr^/a 


jf7(*)si 


b 

•  AWT*  , 

sin - ax 


a 


x  sin(2n-l)^sinh(2n-l)^ 

Mv)-:2 


77  „  =  1  (2/j  —  l)sinh(2/j-  \)irb/a 

4  oo  sinh(2n-l)^sin(2/i-l)^ 

77  (2ai  —  l)sinh(2/j—  \)ira/b 

x  —  a  . 


Try 


qq  sinh(2n  — l)7r--r-^sin(2Ai  — l)-r 

7.«(v)-l  +  ;2 


9.  H(x,.y)  = 


!  (2aj  —  l)sinh(2/z  —  \)ira/b 
—  2  «  (—  O^sinh  Vl  +  «2?r2  *  sin  wry 


7T 


«  =  l  «  sinh  V 1  +  n2  7T2 


11.  2T"+XA'=0;  Y"  +  nY=0 ;  Z"-(X  +  ju)Z=0 
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Answers  to  odd-numbered  exercises 


Section  6.3 

1.  p0(x)  =  ^-,Pl(x)  =  ^-X,  p2(x) 


12 

T 


Section  6.4 

7.  y(x)  =  csinh^— X0x,  where  sinh^  —  X0n  =  y/  —  X0  cosh^/  —  XQn\ 

y(x)  =  csin^/I„x,  where  tan^/I^  =  yfXn 
9.  X  =  —  1,  y(x)  =  cex\  X  =  n2,  y(x)  =  c[n cos nx  +  sin nx ] 
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Index 


Adj  oint  matrix  315 
Analytic  functions  189 
Asymptotic  stability  381 
Atomic  waste  problem  46 
Autonomous  systems  376 

Bernoulli  equation  67 
Bessel  equation  137 
order  v  218 
order  zero  220 
Bessel  function 
Ut)  218 
J0(t)  220 

Boundary  conditions  for 
elastic  string  483 
heat  equation  482 
Laplace  equation  483 
second  order  o.d.e.  476,  533 
Boundary-value  problem  476,  483,  533 
heat  equation  483 
Laplace  equation  508 
wave  equation  503 

Carbon  dating  19 
Cauchy  ratio  test  189 
Cayley-Hamilton  theorem  351 
Characteristic  equation  138 
complex  roots  141 
real  and  equal  roots  145 
real  and  unequal  roots  138 
Characteristic  polynomial  334 
complex  roots  341 
real  distinct  roots  334 
repeated  roots  345 


Combat  models  405 
Complex  exponentials  141 
Compound  interest  57 
Convolution  integral  252 


Damping  force  166 
Detecting  art  forgeries  1 1 
Detection  of  diabetes  178 
Difference  equations  91 
Differential  operators  129 
Diffusion  equation  481 
Dirac,  P.  A.  M.  244 
Dirac  delta  function  244 
Laplace  transform  of  246 
Direction  field  397 
Dirichlet  problem  483 
Discontinuous  forcing  function  238 


Eigenfunctions  478 
Eigenvalues  of 

boundary-value  problem  478,  533 
matrix  333 
multiplicity  of  347 
Eigenvectors  of  a  matrix  333 
linear  independence  of  335 
Elastic  membrane 
vibration  of  482 
Elastic  string  482,  503 
boundary  conditions  for  483 
fundamental  frequencies  of  505 
initial  conditions  for  483 
nonzero  initial  displacement  505 
propagation  of  initial  conditions  506 
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Index 


Electrical  circuits  175 
Epidemic  models  38 
gonorrhea  491 
plagues  39 
threshold  theorem  460 
Equilibrium  points  373 
Error 

effect  of  step  size  102 
for  Euler’s  method  100 
formula  101 

for  improved  Euler’s  method  110 
round-off  105 

for  Runge-Kutta  method  113 
for  three- term  Taylor  series  107 
Escape  velocity  52 
Euler  equation  145,  149,  198 
Euler  method  97 
formula  error  101 
Even  functions  493 
Exact  equations  58,  60 
Existence  and  uniqueness  theorems  for 
first-order  equations  67 
orbits  314 

second-order  equations  129 
systems  of  first-order  equations  291,  414 
Extension  of  a  function 
even  495 
odd  496 

Finding  roots  by  iteration  81 
First-order  differential  equations  1,  2 
exact  58, 60 

existence  and  uniqueness  theorem  for  67 

general  solutions  of  4,  21 

homogeneous  3 

initial-value  problem  5,  7,  21 

integrating  factor  for  8,  64 

linear  3 

nonlinear  3 

numerical  solutions  of,  see  Numerical 
methods 
separable  21 
several  variables  264 
systems  of  265 
Fourier  series  487,  488 
convergence  of  487, 488 
cosine  series  494 
Fourier  coefficients  489 
Parseval  identity  492 
sine  series  494 
Frobenius  method  203 
Fundamental  matrix  355 
Fundamental  set  of  solutions  133 

General  solutions 
first-order  linear  equations  4,  21 
higher-order  equations  259 
second-order  equations  132,  142, 148 
systems  of  first-order  equations  334 


Generalized  functions  249 
Gompertzian  relation  53 
Gonorrhea  model  491 
Green’s  theorem  437 


Heat  equation  481,499 
bar  with  insulated  ends  500 
boundary  conditions  483 
initial  conditions  483 
nonhomogeneous  boundary  conditions 
503 

smoothing  of  discontinuities  in  initial 
conditions  499 

Hermite  equation  197,  515,  539 
Hermite  polynomial  197,  541 
Higher-order  equations  259 
general  solution  259 
variation  of  parameters  262 
Homogeneous  equations  25 
Hypergeometric  equations  218 


Identity  matrix  311 
Impedance  177 
Improved  Euler  method  109 
Indicial  equation  209 
Initial  conditions  482 
for  elastic  string  483 
for  heat  equation  483 
propagation  of  initial  conditions  for  wave 
equation  506 

smoothing  of  discontinuities  for  heat 
equation  499 

Initial- value  problem  5,  7,  21, 128,  259 
Inner  product  516 
Integral  equation  70 
Integrating  factor  8,  64 
Inverse  matrix  316 


Jump  discontinuities  227,  238 
Judicious  guessing  of  particular  solutions 
157 


Kirchoff’s  second  law  176 


Laguerre  equation  218,  539 
Laplace  equation  482,  508 
boundary  conditions  483 
Dirichlet  problem  483 
Neumann  problem  483 
Laplace  transform  225 
of  the  convolution  252 
definition  226 
of  derivatives  229 
of  the  Dirac  delta  function  246 
existence  of  227 
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Index 


inverse  of  230 
of  systems  368 
Legendre  equation  197,  539 
Legendre  polynomials  197 
Limit  cycles  436 
Linear  algebra  271,  296 
Linear  independence  136,  281 
of  eigenvectors  335 
of  vector  functions  293 
of  vectors  281 
Linear  operators  131 
Linear  differential  equations,  see  Firstr-order 
differential  equations,  Second-order 
differential  equations,  or  Systems 
of  first-order  differential  equations 
Linear  transformations  324 
Logistic  curves  30,  43 
Logistic  law  of  population  growth  28 

Malthusian  law  of  population  growth  27 
Matrices  267 
addition  of  276 
adjoint  315 

characteristic  polynomial  334 
determinant  of  299 
diagonal  301 
lower  301 
upper  301 
eigenvalues  of  333 
eigenvectors  of  333 
fundamental  355 
Hermitian  530 
identity  311 
inverse  316 
product  of  310 
selfadjoint  530 
Mechanical  vibrations  165 
damped  forced  vibrations  169 
damped  free  vibrations  167 
forced  free  vibrations  170 
free  vibrations  166 
resonant  frequency  173 
Method  of  elimination  257 
Mixing  problems  53 

Natural  frequency  171 
Neumann  problem  483 
Newtonian  mechanics  46, 127, 166 
Newton’s  method  88 
Nonlinear  differential  equations 
autonomous  system  376 
Numerical  methods  95 
effect  of  step  size  102 
error  100, 102, 107, 110, 113 
Euler  97 
formula  error  101 
improved  Euler  110 
Runge-Kutta  113 
three-term  Taylor  series  107 


Odd  functions  493 
Ohm’s  law  176 
Orbits  394, 397 

Order  of  a  differential  equation  1 
Ordinary  differential  equations 
definition  1 


Parseval’s  identity  492 
Partial  differential  equations 
definition  481 
Particular  solution  151,  364 
Periodic  solutions  416 
Phase  plane  394 
Phase  portrait  418,  419 
Picard  iterates  70 
Piecewise  continuous  functions  227 
Poincare- Bendixson  theorem  433 
Population  models  27 
competitive  exclusion  451 
Logistic  law  of  population  growth  28 
Malthusian  law  of  population  growth  27 
sharks  443, 444 
Potential  equation  482 
Power  series  188 


Qualitative  theory  372 


Radioactive  dating  12 
Reduction  of  order  147 
Reduction  to  systems  of  equations  265 
Regular  singular  points  204 
Resonance  170 
Resonant  frequency  173 
Reynolds  number  50 
Runge-Kutta  method  113 


Schwartz,  Laurent  245,  248 
Second-order  differential  equations  127, 138 
characteristic  equation  138 
complex  roots  141 
real  and  equal  roots  145 
real  and  unequal  roots  138 
Euler  equation  145, 150, 198 
existence  and  uniqueness  theorem  129 
fundamental  set  of  solutions  133 
general  solutions  132,  142,  148 
homogeneous  equation  128 
judicious  guessing  of  particular  solution 
157 

nonhomogeneous  equation  151 
discontinuous  function  238 
particular  solution  151 
reduction  of  order  147 
series  solution  185,  188 
singular  points  198,  204 
variation  of  parameters  155 
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Index 


Separable  equations  20 
Separation  of  variables  483,  484 
for  heat  equation  484,  500 
for  Laplace  equation  508,  510 
for  wave  equation  503 
Sequence  of  eigenvalues  478 
Series  solution  185, 188 
recurrence  formula  187 
when  roots  of  the  indicial  equation  are 
equal  212, 216 

when  roots  of  the  indicial  equation  differ 
by  an  integer  212,  216 
Singular  points  198,  204 
Solutions,  definition  1 
first-order  equations  4,  21,  60 
higher-order  equations  259 
second-order  equations  132,  142,  148 
systems  of  first-order  equations  334 
Spread  of  technological  innovations  39 
Spring-mass  dashpot  system  165 
damped  vibrations  167,  169 
forced  vibrations  170 
free  vibrations  166 
Stability 

asymptotic  381 
of  equilibrium  points  385,  386 
linear  systems  378 
of  a  solution  373,  378 
Sturm- Liouville  boundary  value  problem 
533 

Systems  of  algebraic  equations  297 
Systems  of  linear  first-order  equations  265 
characteristic  polynomial  334 
complex  roots  341 
real  distinct  roots  334 
repeated  roots  345 
definition  265 


existence  and  uniqueness  291 
fundamental  matrix  355 
general  solution  of  334 
nonhomogeneous  360 
reduction  to  265 
stability  of  378 
variation  of  parameters  360 


Tacoma  Bridge  disaster  173 
Taylor  series  189 
Tchebycheff  equation  197,539 
Tchebycheff  polynomials  197,  541 
Theory  of  war  398 
Thermal  diffusivity  481 
Three-term  Taylor  series  106 
Trajectories  394 
Tumor  growth  52 


Variation  of  parameters  155 
for  systems  of  equations  360 
Vectors  266 

linear  independence  of  281 
solutions  of  systems  of  equations  297 
valued  functions  266 
Vector  spaces  273 
basis  287 
dimension  of  283 
Volterra,  Vito  444 


Wave  equation  482,  503 
boundary  conditions  483 
initial  conditions  483 
solution  504 
Wronskian  133 
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